NanoDAPFAM / NanoDAPFAMOutTitlAbsToTitlAbs

Overview

NanoDAPFAMOutTitlAbsToTitlAbs is an English patent-family retrieval task where both source queries and target documents contain only title and abstract. Positives are DAPFAM OUT-domain citation relations, so target families are technically related but do not share IPC3 domain with the query family.

This is the most compact cross-domain DAPFAM split. It asks whether short patent summaries can retrieve other short summaries from different technical fields. The task is hard because both detailed claims and full descriptions are absent.

Details

What the Original Data Measures

DAPFAM uses family-level citation links as qrels and IPC3 overlap to define domain relations. OUT-domain positives are cited families outside the query's IPC3 domain. This split uses title and abstract fields for both source and target families.

The task measures cross-domain patent-summary retrieval, where the relevant relationship must be inferred from concise descriptions.

Observed Data Profile

This Nano split contains 200 queries, 10,000 documents, and 1,257 positive qrels. There are 159 multi-positive queries. Positives per query average 6.29, with a minimum of 1, median of 4.0, and maximum of 20. Queries average 786.61 characters, and documents average 777.94 characters.

The field symmetry removes long-document noise but also removes much of the evidence that could explain cross-domain relevance.

BM25 Evaluation Profile

BM25 reaches nDCG@10 of 0.0583, hit@10 of 0.1750, and recall@100 of 0.1527 with a top-500 candidate pool. The low score shows that exact title and abstract overlap is a weak signal for cross-domain citation retrieval.

BM25 may find candidates with similar words, but OUT-domain positives often use different domain vocabularies for related mechanisms or effects.

Dense Evaluation Profile

The dense harrier-oss-270m profile is strongest by top-rank metrics, with nDCG@10 of 0.0872, hit@10 of 0.3150, and recall@100 of 0.2045. Dense retrieval improves over BM25 by matching abstract technical meaning across domains.

The score remains modest because title-abstract text alone may not expose enough detail to identify cross-domain citation relevance.

Reranking Hybrid Evaluation Profile

The reranking_hybrid candidate subset reaches nDCG@10 of 0.0762, hit@10 of 0.2650, and recall@100 of 0.2188. It uses top-100 candidates with optional rank-101 safeguards; 76 rows contain 101 candidates and 76 safeguard-positive rows are recorded. Hybrid improves recall over dense but ranks positives lower at the top.

This indicates that lexical evidence occasionally adds missing positives, but dense retrieval is more reliable for top-10 ordering.

Metric Interpretation for Model Researchers

This is a compact cross-domain summary retrieval stress test. Low BM25 and dense scores show that patent title-abstract fields are often insufficient for easy OUT-domain discovery. Models need cross-domain semantic abstraction rather than keyword search.

The many safeguard rows show that candidate generation itself is difficult. A reranker cannot solve the task unless positives first enter the candidate pool.

Query and Relevance Type Tendencies

Queries and documents are title-abstract summaries. Positives are cross-domain cited families. Relevance may be based on analogy, shared technical effect, transferred mechanism, or problem-solution correspondence.

Representative Failure Modes

BM25 fails when domain vocabulary changes. Dense retrieval may find broad analogy but not the cited family. Hybrid retrieval can improve recall while still over-ranking candidates with superficial overlap.

Training Data That May Help

Useful training data includes cross-domain patent abstract citation retrieval, technology analogy search, and cross-IPC prior-art retrieval. Training should exclude NanoDAPFAM evaluation family IDs, positives, qrels, and duplicates.

Synthetic data should use compact title and abstract records from different domains, with positives that are cross-domain cited patent summaries.

Model Improvement Notes

Improving this task requires strong patent-summary embeddings that can represent function and effect independently of IPC vocabulary. Hard negatives should include same-word but non-cited summaries and different-word but functionally related positives.

Citation-graph-aware contrastive training may be particularly useful.

Example Data

Query	Positive document
bicycle handlebar grip a bicycle handlebar grip contains a plastic inner shell having a tubular shape and an outer surface; a fiber layer having an inner surface and an outer surface and includes a plurality of fibers interweaving with each other and a plurality of weaving gaps located between the fibers; a plastic layer enclosed around the outer surface of the fiber layer and combined with the fiber layer integrally and including a holding portion coated on the outer surface of the fiber layer,... [500 / 821 chars]	durable flexible membrane and method of making same a flexible membrane having a valuable combination of desirable properties is composed of a generally heavy, dense supporting and reinforcing reticulated base fabric constituted of thick, generally loose bundles of multiple continuous filaments arranged in a mechanically interengated reticular array having an overall weight within the range of about 3-12 oz/yd.sup.2 the continuous filaments being of a synthetic polymer having good dimensional stability and high resistance to heat and light; and a solidified base coating composition completely impregnating the interstices of the base fabric and as forming a continuous coating along the opposite sides thereof, the composition having as essential ingredients a pvc resin and a plasticizer therefor in the amount of 35-75% of the resin of a trimellitate ester which exhibits high resistance to separation from the resin and imparts to the coating high flexibility at temperatures at least as lo... [1,000 / 1,921 chars]
method for improving belt press dewatering a method for increasing the removal of a higher fraction of liquid from the press cake in any belt press is described. specifically, the invention incorporates a series of rollers that create multiple pinch points to compress the solid fraction while removing liquid. after each pinch point, the solid material is allowed to separate from the belt, fall by gravity, and repack so that more liquid can be released at each successive pinch point than is possi... [500 / 620 chars]	artificial human anti-factor b antibody problem to be solved: to provide novel engineered forms of a monoclonal antibody and antigen-binding fragments thereof that bind complement protein factor b and selectively inhibit the alternative complement pathway.solution: artificial human anti-factor b antibodies or antigen-binding fragments thereof are derived from murine monoclonal antibody 1379 "mab1379", which selectively binds factor b in the third short consensus repeat ("scr") domain and prevents formation of the c3bbb complex. [533 chars]
stitch distribution control system for tufting machines a stitch distribution control system for a tufting machine for controlling placement of yarns being fed to the needles of the tufting machine by yarn feed mechanisms to form a desired pattern. a backing material is fed through the tufting machine at an increased stitch rate as the needles are shifted according to calculated pattern steps. a series of loopers or hooks engage and pick loops of yarns from the needles. the yarn feed mechanisms... [500 / 647 chars]	method and apparatus for measuring direction or position of weft yarn of fabric the measurement of the pick or stitches course position in continuously moved fabrics involves examining at least one gap-shaped segment in a top illumination or transillumination. the width of the segment is small and its legnth long in comparison to the thickness of the picks. the brightness value inside the segment is divided into two stages or areas (light, dark). those sections within the segment, in which the value is continuously associated with one stage, are determined. the number or total length of the sections of s stage is determined or, alternatively the speed at which the sections of a stage move in the segment is determined and the drafting angle of the pick is deduced from this value. [789 chars]

Source Reference Table

Source	Role
DAPFAM: A Domain-Aware Family-level Dataset to benchmark cross domain patent retrieval	Source benchmark paper for family-level patent retrieval.
DAPFAM DOI record	DOI record for the DAPFAM paper.
datalyes/DAPFAM_patent	Public source dataset card.
hakari-bench/NanoDAPFAM	Nano benchmark dataset containing this split.

Dataset Information

Field	Value
Nano set	NanoDAPFAM
Backing dataset	NanoDAPFAM
Task / split	NanoDAPFAMOutTitlAbsToTitlAbs
Hugging Face dataset	hakari-bench/NanoDAPFAM
Language	en
Category	natural_language
Queries	200
Documents	10,000
Positive qrels	1,257
Positives / query avg	6.29
Positives / query min	1
Positives / query median	4.00
Positives / query max	20
Multi-positive queries	159 (79.50%)
Query length avg chars	786.61
Document length avg chars	777.94

Candidate Subsets

Profile	Config	nDCG@10	Hit@10	Recall@100	Candidates
BM25	`bm25`	0.0583	0.1750	0.1527	top-500
Dense	`harrier_oss_v1_270m`	0.0872	0.3150	0.2045	top-500
Reranking hybrid	`reranking_hybrid`	0.0762	0.2650	0.2188	top-100

Training and Leakage Metadata

Original train split: not_confirmed
Evaluation split origin: DAPFAM OUT-domain title-abstract to title-abstract patent-family retrieval
Train/eval overlap audit: not_audited
Leakage note: exclude NanoDAPFAM evaluation family IDs, positives, qrels, and duplicates
Multi-positive training: citation_family_multi_positive
Useful training data: cross-domain patent abstract citation retrieval, technology analogy search, cross-IPC prior-art retrieval