HAKARI-Bench

NanoDAPFAM / NanoDAPFAMInTitlAbsToTitlAbsClm

Overview

NanoDAPFAMInTitlAbsToTitlAbsClm is an English patent-family retrieval task. The query contains a source patent family's title and abstract, and the target document contains title, abstract, and claims. Relevance is restricted to DAPFAM IN-domain citation relations, meaning positives share IPC3 technical class with the query family.

This split tests compact-query to claim-rich target retrieval inside the same technical domain. The target claims expose more detail than a short abstract, but they also introduce formal claim language and many same-domain distractors.

Details

What the Original Data Measures

DAPFAM uses patent-family citations as qrels and labels domain relations by IPC3 overlap. This split uses IN-domain positives and compares a title-abstract source representation against target records enriched with claims.

The task measures whether a short patent summary can retrieve same-domain cited families when candidate documents include detailed claim language.

Observed Data Profile

This Nano split contains 200 queries, 10,000 documents, and 3,072 positive qrels. There are 194 multi-positive queries. Positives per query average 15.36, with a minimum of 1, median of 18.0, and maximum of 20. Queries average 771.27 characters, and target documents average 7,226.36 characters.

The target claims provide more evidence than title-abstract records, but also more repeated legal vocabulary and term overlap among same-domain candidates.

BM25 Evaluation Profile

BM25 reaches nDCG@10 of 0.3593, hit@10 of 0.8500, and recall@100 of 0.5160 with a top-500 candidate pool. Claims on the target side improve lexical coverage because query terms may appear in target claim elements.

BM25 still trails dense and hybrid methods. Same-domain claims often share broad terminology, so lexical overlap alone is not enough to identify citation-related families.

Dense Evaluation Profile

The dense harrier-oss-270m profile reaches nDCG@10 of 0.4125, hit@10 of 0.8700, and recall@100 of 0.6452. Dense retrieval improves positive coverage by matching summary-level invention meaning to detailed target claims.

Dense retrieval is useful when the target claim structure expresses the same technical contribution with different wording from the source abstract.

Reranking Hybrid Evaluation Profile

The reranking_hybrid candidate subset is strongest by nDCG@10 and hit@10, reaching 0.4220 and 0.8850 respectively, with recall@100 of 0.6413. It uses top-100 candidates with optional rank-101 safeguards; five rows contain 101 candidates and five safeguard-positive rows are recorded. Dense retrieval has slightly higher recall@100, while hybrid has better top-10 ordering.

This suggests that exact target claim terms and dense semantic matching are both valuable. Hybrid ranking helps place relevant same-domain families higher when the target claims contain useful lexical anchors.

Metric Interpretation for Model Researchers

This is a hybrid-friendly in-domain patent retrieval task. Target claims add useful lexical and semantic detail, so combining BM25 and dense candidates can improve top ranking. Dense retrieval remains slightly better for broad positive coverage.

As with other DAPFAM splits, multi-positive recall matters. A strong system should recover many same-domain cited families, not just the easiest one.

Query and Relevance Type Tendencies

Queries are title-abstract summaries. Documents contain title, abstract, and claims. Positives are cited families in the same IPC3 domain. Target claims often contain the detailed component or process evidence missing from the short query.

Representative Failure Modes

BM25 may over-rank target claims that repeat query terms but describe a different invention. Dense retrieval may retrieve semantically close same-domain patents without citation relation. Hybrid retrieval can improve top ranking but still confuse claim-rich distractors.

Training Data That May Help

Useful training data includes same-domain title-abstract to patent-claims retrieval, IPC-restricted citation prediction, and prior-art search with claim-rich targets. Training should exclude NanoDAPFAM evaluation families, positives, qrels, and same-family duplicates.

Synthetic data should pair compact source title-abstract summaries with same-domain target records containing title, abstract, and claims.

Model Improvement Notes

Improving this task requires summary-to-claim alignment. Models should connect abstract-level invention descriptions to specific claim elements and should down-weight generic claim boilerplate.

For reranking, claim-element matching and citation-style dependency features are likely useful.

Example Data

QueryPositive document
snow removal equipment with automatic walking function the invention relates to snow removal equipment with an automatic walking function. the snow removal equipment comprises a walking module, a working module and a control module, wherein the walking module drives a snow removal machine to move; the working module comprises a working motor and a snow throwing mechanism driven by the working motor, and the snow throwing mechanism collects and throws out snows and occluded foreign substances on... [500 / 988 chars]multifunctional device for clearing snow an apparatus and method for clearing an accumulation of matter from a surface that includes a blade configured to collect matter upon movement of the apparatus and means to shift the collected matter and distribute it laterally from the apparatus. the apparatus may include a plurality of helically arranged ribbons adjacent the blade formed such that the radial distance from a central axis decreases away from a center portion of the blade. a sealed rotor chamber may also include a rotor with blades that are adjustable in a radial direction or flared in both first and second rotational directions. claims what is claimed is: 1. an apparatus for clearing an accumulation of matter from a surface, including: a blade configured to collect matter upon movement of the apparatus, the blade including a central portion and lateral portions; a first ribbon and a second ribbon located adjacent the blade, the first ribbon and the second ribbon arranged helical... [1,000 / 12,229 chars]
waste disposal devices waste disposal device including a housing defining a waste compartment for receiving enclosed waste and arranged to removably receive a cartridge containing a length of flexible tubing which operatively receives waste therein, a retention mechanism for holding a quantity of waste received in the tubing and a rotation mechanism for rotating the retention mechanism when the quantity of waste is held thereby and while the cartridge is stationary in order to twist the tubing a... [500 / 891 chars]cassette for dispensing pleated tubing a cassette for use in dispensing a pleated tubing. the cassette includes an annular body having a generally u shaped housing with an open central cylindrical core. the annular body includes an inner wall, an angular wall a bottom wall and an outer wall. the annular cover has an outer wall and a ledge that extends radially inward from the outer wall and over the annular body that defines a gap between an inner edge of the ledge and the inner wall of the annular body. an inter-engagement mechanism is provided on the annular body and on opposite edges of the annular cover that cooperates to secure the cover to the body. at least one aperture is provided in the angular wall to enable ventilation of the air. a cassette (10) for use in dispensing a pleated tubing (50), comprising: an annular body (20) having a housing with a generally u-shaped channel cross section, the housing having a central cylindrical core (27); and an annular cover (40) extending... [1,000 / 3,454 chars]
an article including identification for use in an electrically heated smoking system. there is provided an electrically heated smoking system (101) for receiving a smoking article (115) or cleaning article (205) configured for use with the smoking system (101). the system comprises a cavity (111) for at least partially receiving the smoking article (115) or cleaning article (205). the smoking article includes identification information printed thereon. the cleaning article includes identificatio... [500 / 1,164 chars]apparatus for generating aerosol from an aerosolisable medium, an article of aerosolisable medium and a method of determining a parameter of an article to provide an apparatus that heats an aerosolizable medium to volatilize at least one component of the aerosolizable medium, without burning or combusting the aerosolizable medium.solution: an apparatus 100 comprises: a housing; a chamber 112 for receiving an article 102 comprising an aerosolizable medium and including a marker; and a controller 116. the controller is configured to receive: a first input indicative of a rate of movement of the article, received in use, in the chamber; and a second input indicative of a parameter of the article. at least the second input is determined based on the marker.selected drawing: figure 3 claims 1. an apparatus for generating aerosol from an aerosolisable medium, the apparatus comprising: a housing; a chamber for receiving an article comprising aerosolisable medium and including a marker; a cont... [1,000 / 5,466 chars]

Source Reference Table

SourceRole
DAPFAM: A Domain-Aware Family-level Dataset to benchmark cross domain patent retrievalSource benchmark paper for family-level patent retrieval.
DAPFAM DOI recordDOI record for the DAPFAM paper.
datalyes/DAPFAM_patentPublic source dataset card.
hakari-bench/NanoDAPFAMNano benchmark dataset containing this split.

Dataset Information

FieldValue
Nano setNanoDAPFAM
Backing datasetNanoDAPFAM
Task / splitNanoDAPFAMInTitlAbsToTitlAbsClm
Hugging Face datasethakari-bench/NanoDAPFAM
Languageen
Categorynatural_language
Queries200
Documents10,000
Positive qrels3,072
Positives / query avg15.36
Positives / query min1
Positives / query median18.00
Positives / query max20
Multi-positive queries194 (97.00%)
Query length avg chars771.27
Document length avg chars7,226.36

Candidate Subsets

ProfileConfignDCG@10Hit@10Recall@100Candidates
BM25bm250.35930.85000.5160top-500
Denseharrier_oss_v1_270m0.41250.87000.6452top-500
Reranking hybridreranking_hybrid0.42200.88500.6413top-100

Training and Leakage Metadata