HAKARI-Bench

NanoDAPFAM / NanoDAPFAMInTitlAbsToTitlAbs

Overview

NanoDAPFAMInTitlAbsToTitlAbs is an English patent-family retrieval task where both source queries and target documents contain only title and abstract. Positives are restricted to DAPFAM IN-domain citation relations, so target families share IPC3 technical class with the query family.

This is a compact same-domain patent-summary retrieval task. It resembles title-abstract prior-art search in a known technical class. Since both sides are short summaries, the task is less noisy than full-text retrieval but provides fewer details for distinguishing cited families.

Details

What the Original Data Measures

DAPFAM measures patent-family retrieval with citation qrels and domain labels. The IN condition keeps citation-related families that share IPC3 domain with the source. This split uses title and abstract fields on both sides, making it a summary-to-summary version of in-domain patent retrieval.

Observed Data Profile

This Nano split contains 200 queries, 10,000 documents, and 3,066 positive qrels. There are 194 multi-positive queries. Positives per query average 15.33, with a minimum of 1, median of 18.0, and maximum of 20. Queries average 771.27 characters, and documents average 777.74 characters.

The field symmetry makes this split useful for isolating semantic summary matching without long-document artifacts.

BM25 Evaluation Profile

BM25 reaches nDCG@10 of 0.3386, hit@10 of 0.8650, and recall@100 of 0.4677 with a top-500 candidate pool. Exact technical terms in titles and abstracts are useful, especially because positives share the same domain.

BM25 still has limited recall. Same-domain abstracts may share terms even when not citation-related, and cited families may use different terminology for the same technical contribution.

Dense Evaluation Profile

The dense harrier-oss-270m profile reaches nDCG@10 of 0.3923, hit@10 of 0.8750, and recall@100 of 0.6093. Dense retrieval improves recall substantially over BM25 by matching semantic relatedness between compact invention summaries.

Dense retrieval is helpful for paraphrased same-domain relevance, but it must still distinguish citation positives from other abstracts in the same field.

Reranking Hybrid Evaluation Profile

The reranking_hybrid candidate subset reaches nDCG@10 of 0.3942, hit@10 of 0.8700, and recall@100 of 0.5985. It uses top-100 candidates with optional rank-101 safeguards; nine rows contain 101 candidates and nine safeguard-positive rows are recorded. Hybrid slightly leads nDCG, while dense leads hit@10 and recall@100.

The close scores show that lexical and dense signals overlap substantially in this compact representation. Hybrid can improve top ordering for some queries, but dense retrieval provides slightly broader positive coverage.

Metric Interpretation for Model Researchers

This is a compact in-domain patent retrieval task with multi-positive qrels. It is a good setting for evaluating patent-summary embeddings without the confound of very long targets or claim-heavy text.

The key metric distinction is that BM25 has a high hit@10 but much lower recall@100 than dense. It often finds one related family but misses much of the citation set.

Query and Relevance Type Tendencies

Queries and documents are title-abstract summaries. Positives are same-domain citation-related families. The short records emphasize invention summaries, problem statements, and technical effects.

Representative Failure Modes

BM25 may rank abstracts with shared terminology but no citation link. Dense retrieval may over-rank broadly similar summaries in the same IPC3 area. Hybrid retrieval may not add much when both signals retrieve similar same-domain candidates.

Training Data That May Help

Useful training data includes same-domain title-abstract patent citation retrieval, IPC-restricted patent semantic search, and prior-art retrieval over compact patent summaries. Training should exclude NanoDAPFAM evaluation families, positives, and qrels.

Synthetic data should generate short title and abstract patent records in shared technical classes, with positives drawn from cited same-domain families.

Model Improvement Notes

Improving this task requires fine-grained summary embeddings for patents. Models should capture technical problem, solution, effect, and domain without relying only on shared class vocabulary.

For reranking, citation-style relation modeling may help separate true positives from same-topic summaries.

Example Data

QueryPositive document
snow removal equipment with automatic walking function the invention relates to snow removal equipment with an automatic walking function. the snow removal equipment comprises a walking module, a working module and a control module, wherein the walking module drives a snow removal machine to move; the working module comprises a working motor and a snow throwing mechanism driven by the working motor, and the snow throwing mechanism collects and throws out snows and occluded foreign substances on... [500 / 988 chars]multifunctional device for clearing snow an apparatus and method for clearing an accumulation of matter from a surface that includes a blade configured to collect matter upon movement of the apparatus and means to shift the collected matter and distribute it laterally from the apparatus. the apparatus may include a plurality of helically arranged ribbons adjacent the blade formed such that the radial distance from a central axis decreases away from a center portion of the blade. a sealed rotor chamber may also include a rotor with blades that are adjustable in a radial direction or flared in both first and second rotational directions. [643 chars]
waste disposal devices waste disposal device including a housing defining a waste compartment for receiving enclosed waste and arranged to removably receive a cartridge containing a length of flexible tubing which operatively receives waste therein, a retention mechanism for holding a quantity of waste received in the tubing and a rotation mechanism for rotating the retention mechanism when the quantity of waste is held thereby and while the cartridge is stationary in order to twist the tubing a... [500 / 891 chars]cassette for dispensing pleated tubing a cassette for use in dispensing a pleated tubing. the cassette includes an annular body having a generally u shaped housing with an open central cylindrical core. the annular body includes an inner wall, an angular wall a bottom wall and an outer wall. the annular cover has an outer wall and a ledge that extends radially inward from the outer wall and over the annular body that defines a gap between an inner edge of the ledge and the inner wall of the annular body. an inter-engagement mechanism is provided on the annular body and on opposite edges of the annular cover that cooperates to secure the cover to the body. at least one aperture is provided in the angular wall to enable ventilation of the air. [751 chars]
an article including identification for use in an electrically heated smoking system. there is provided an electrically heated smoking system (101) for receiving a smoking article (115) or cleaning article (205) configured for use with the smoking system (101). the system comprises a cavity (111) for at least partially receiving the smoking article (115) or cleaning article (205). the smoking article includes identification information printed thereon. the cleaning article includes identificatio... [500 / 1,164 chars]apparatus for generating aerosol from an aerosolisable medium, an article of aerosolisable medium and a method of determining a parameter of an article to provide an apparatus that heats an aerosolizable medium to volatilize at least one component of the aerosolizable medium, without burning or combusting the aerosolizable medium.solution: an apparatus 100 comprises: a housing; a chamber 112 for receiving an article 102 comprising an aerosolizable medium and including a marker; and a controller 116. the controller is configured to receive: a first input indicative of a rate of movement of the article, received in use, in the chamber; and a second input indicative of a parameter of the article. at least the second input is determined based on the marker.selected drawing: figure 3 [789 chars]

Source Reference Table

SourceRole
DAPFAM: A Domain-Aware Family-level Dataset to benchmark cross domain patent retrievalSource benchmark paper for family-level patent retrieval.
DAPFAM DOI recordDOI record for the DAPFAM paper.
datalyes/DAPFAM_patentPublic source dataset card.
hakari-bench/NanoDAPFAMNano benchmark dataset containing this split.

Dataset Information

FieldValue
Nano setNanoDAPFAM
Backing datasetNanoDAPFAM
Task / splitNanoDAPFAMInTitlAbsToTitlAbs
Hugging Face datasethakari-bench/NanoDAPFAM
Languageen
Categorynatural_language
Queries200
Documents10,000
Positive qrels3,066
Positives / query avg15.33
Positives / query min1
Positives / query median18.00
Positives / query max20
Multi-positive queries194 (97.00%)
Query length avg chars771.27
Document length avg chars777.74

Candidate Subsets

ProfileConfignDCG@10Hit@10Recall@100Candidates
BM25bm250.33860.86500.4677top-500
Denseharrier_oss_v1_270m0.39230.87500.6093top-500
Reranking hybridreranking_hybrid0.39420.87000.5985top-100

Training and Leakage Metadata