Common XML Attributes

The following table describes attributes that can be found in most of the data containers.

Attribute

Description

ucid

Unique document identifier based on country, doc-number, and kind, i.e. US-96142365-A

Note: the doc-number may be an application or publication number. See application-reference and publication-reference

mxw-id

Internal record-level identifier

created-load-id

Identifies the batch of records in which the document was added to the database

modified-load-id

Identifies the most recent batch of records in which the document was modified

load_source

Identifies the source of the data loaded into Alexandria. Values include: 

patent-office: Identifies documents published by PTOs and documents from third-party providers who receive data from PTOs
docdb: Weekly updates from the EPO product DOCDB/INPADOC 
google: English translations from Google's translation service
translated: Human-translated content 
mxw-smt: Data translated by Statistical Machine Translation (SMT) 
us-assign: Reassignment data from the USPTO 
inpadoc-ls: Legal status events from the EPO INPADOC service 
ipcr and mcf: Reclassification files from EPO and USPTO, respectively
ifi: Value-added data from IFI CLAIMS processing

status

Internal attribute used in update procedures. Values include:

    • new
    • corrected
    • deleted

The "corrected" status means that there has been some update to the document from the initial load. This could be a change in assignment, new classifications, or other updates. These changes can come from curation work IFI does to standardize assignees and track current assignments, changes in legal status, or other updates from the patent office. 


The "deleted" status can mean one of two things:

  1. The originating data provider (patent office or third party) has requested that we remove a document.
  2. The originating data provider (patent office or third party) has changed the makeup of the ucid either by remapping a kind code or changing the format of the publication number.

In both of these cases, CLAIMS Direct sets the status=deleted, deleted_load_id=<load-id> and removes all data in all satellite tables, although deleted records are indexed. Documents marked @status=deleted should never be extracted from your CLAIMS Direct instance or, if they are marked at a later date, should be removed from downstream processing.

ref-uciducid of related document used for reference data, such as a PCT application
format

Designates the normalized or not-normalized format of the following document-id:

FormatDescriptionExample(s)
epo

Standardized name according to the DOCDB file, all caps, no punctuation, limited to 30 characters.
For the names of individuals, the format is LAST_NAME FIRST_NAME MIDDLE_NAME/INITIAL.
See the EPO website for DOCDB's list of standardized names.

1. IBM 
2. FINLAND TELECOM OY 
3. THE UNITED STATES GOVERNMENT
4. TUPPER ALAN WILLIAM

intermediatePre-standardized name, converted to all caps.
For the names of individuals, the format is LAST_NAME, FIRST_NAME MIDDLE_NAME/INITIAL.
1. INTERNATIONAL BUSINESS MACHINES 
2. TELECOM FINLAND OY 
3. DEPARTMENT OF THE NAVY
4. TUPPER, ALAN WILLIAM
original

Name as filed, provided directly from the publishing source (can be in non-Latin characters).
For the names of individuals, the format is Last_name, First_name Middle_name/initial.

1. International Business Machines Corporation 
2. Sonera Oyj 
3. The United States Government as represented by the Department of the Navy
4. Tupper, Alan William

original-translatedTranslation of name provided by patent authority
standardDeprecated
usptoDeprecated