Common XML Attributes

The following table describes attributes that can be found in most of the data containers.

Attribute

Description

ucid

Unique document identifier based on country, doc-number, and kind, i.e. US-96142365-A

Note: the doc-number may be an application or publication number. See application-reference and publication-reference

mxw-id

Internal record-level identifier

load_source

Identifies the source of the data loaded into Alexandria. Values include: 

patent-office: Identifies documents published, usually weekly, by PTOs 
docdb: Weekly updates from the EPO product DocDB/Inpadoc 
google: English translations from Google's translation service
translated: Human-translated content 
mxw-smt: Data translated by Statistical Machine Translation (SMT) 
us-assign: Reassignment data from the USPTO 
inpadoc-ls: Legal status events from the EPO Inpadoc service 
ipcr and mcf: Reclassification files from EPO and USPTO, respectively
ifi: Value-added data from IFI CLAIMS processing

status

Internal attribute used in update procedures. Values include:

    • new
    • corrected
    • deleted

A deleted status can mean one of two things:

  1. The originating data provider (patent office or third party) has requested that we remove a document.
  2. The originating data provider (patent office or third party) has changed the makeup of the ucid either by remapping a kind code or changing the format of the publication number.

In both of these cases, CLAIMS Direct sets the status=deleted, deleted_load_id=<load-id> and removes all data in all satellite tables, although deleted records are indexed. Documents marked @status=deleted should never be extracted from your CLAIMS Direct instance or, if they are marked at a later date, should be removed from downstream processing.

ref-uciducid of related document used for reference data, such as a PCT application
format

Designates the normalized or not-normalized format of the following document-id:

FormatDescriptionExample(s)
epo

Standardized name according to the DocDB file, all caps, no punctuation, limited to 30 characters.
For the names of individuals, the format is LAST_NAME FIRST_NAME MIDDLE_NAME/INITIAL.
See the EPO website for DocDB's list of standardized names.

1. IBM 
2. FINLAND TELECOM OY 
3. THE UNITED STATES GOVERNMENT
4. TUPPER ALAN WILLIAM

intermediatePre-standardized name, converted to all caps.
For the names of individuals, the format is LAST_NAME, FIRST_NAME MIDDLE_NAME/INITIAL.
1. INTERNATIONAL BUSINESS MACHINES 
2. TELECOM FINLAND OY 
3. DEPARTMENT OF THE NAVY
4. TUPPER, ALAN WILLIAM
original

Name as filed, provided directly from the publishing source (can be in non-Latin characters).
For the names of individuals, the format is Last_name, First_name Middle_name/initial.

1. International Business Machines Corporation 
2. Sonera Oyj 
3. The United States Government as represented by the Department of the Navy
4. Tupper, Alan William

original-translatedTranslation of name provided by patent authority