You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 8 Next »

XML file format

1. Introduction

The overall scope of BMDW development phase two is to receive more data from our listing organisations and to make these data available through our Search & Match Service. However, old format (DOT20) is not an appropriate format when you have many different fields. therefore, we had to move to another file format. The new file format is an XML (Extensible Markup Language) file which is considered an industry standard that is extendable, robust and easy to use. 

Several people from the community formed a working group to create the required XML Sschema Definition (XSD) files. These files define the elements that are allowed in the XML file, the order of the elements and the values that will be accepted. The names of the elements are based upon EMDIS specifications and aligns with the EMDIS Data Dictionary when appropriate. Several elements are basic elements that should be included in all files, but there are also elements that are specific for only donors or only cord blood units (CBUs).

We will now explain the composition of the XML file and how you should use the XSD reference files.

2. XSD schema files

We provided two XSD sxhema files that define the structure of your XML file basicTypes.xsd and Inventories.xsd.

The Inventories file describes the structure of the XML file and the order of the elements. Here you can also find if a certain field is mandatory or not (minOccurs="0"-> not mandatory). This file includes many "complexType" : an XML element that contains other elements and/or attributes. In the file you can see that the values of the elements can be defined here, like the elements GRID and ID, or that after the name of the field a "type" is defined. For example for the element with name BIRTH_DATE you see type="bareDateType". The definition of "bareDateType" is described in the basicTypes.xsd file.

 We will now describe the global structure of the XML file and the elements.

2.1 InventoryType elements

 

Field Identifier

Required

Description

Type

Length

Comment

CREATION_TIMEYesCreation time stamp of the inventories (in UTC)dateTimeminimal 20

Without fractional seconds the length is 20, for example: 2016-08-23T13:16:48Z.

 Additional notes: CREATION_TIME is defined as "Creation time stamp of the <INVENTORIES>" that means the time in UTC when the complete and valid file was finally created at the registry. This can be the same as SNAPSHOT_TIME.

LISTING_ORGANIZATIONYesOrganisation that lists the donor/cbu provided as IONionType: number between 1000 and 99994Issuing Organisation Number (ION) allocated by ICBBA. This can be different from the POOL when another organisation is sending the data to BMDW.
POOLYesPhysical location of the donors/CBUs of the inventory provided as IONionType: number between 1000 and 99994Physical location of the donors/CBUs of the inventory provided as ION.
CONTENT_TYPEYesType of the inventory items, i.e. donor ("D") or CBU ("C")contentTypeType1The content-type is also shown in the fileName. When CONTENT_TYPE is "D", the INVENTORY must contain <DONOR>-blocks. When CONTENT_TYPE is "C", the INVENTORY must contain <CBU>-blocks.
UPDATE_MODEYesUpdate mode of the inventory, i.e. FULL or DIFFupdateModeType4Only UPDATE_MODE "FULL" is currently supported. Always the complete inventory should be send.
SNAPSHOT_TIMENoTimestamp of the 'data snapshot' (in UTC)dateTimeminimal 20

Without fractional seconds the length is 20, for example: 2016-08-23T13:16:48Z

Additional notes: SNAPSHOT_TIME in the element <INVENTORY> is defined as "timestamp of the data snapshot in UTC" that means the timestamp of the creation of this part of the complete file. This can be the timestamp of the XML export and I guess that in most of the cases it will be identical to the CREATION_TIME.

SCHEMA_VERSIONYesVersion of the applied XML Schema Definition (XSD)schemaVersionType The schema version is very important as this determines the validation rules that should be applied during the processing of your file.

 

2.2 ItemBaseType elements (for Donors and CBUs)

 

Field Identifier

Required

Description

Type

Length

Comment

IDYesUnique identifier of the donor/CBUString17Unique identifier of the donor/CBU: The value comprises the EMDIS hub code + donor identification allocated by the associated donor registry, where the sending organisation is an EMDIS member, otherwise the two digit ISO country code of the associated donor registry + donor identification allocated by the associated donor registry. For example: AU600196166, DEGOE-35487, US087013165, SB45
GRIDNoGlobal registration identifier of the donor/CBUString19 
ATTRNoDescribing attribute of the donor/CBU according to house rules of the sending organization.String3 
BIRTH_DATEYesDate of birth of the donor/CBUbareDateType10Date without timezone information, example 1968-06-28, Date Delimiter = "-"
SEXNoBiological gender of the donor/CBUsexType1

sexType: "F","M"

NOTE: Mandatory for donors, optional for CBUs

ABONoBlood group (ABO) of the donor/CBUaboType2aboType: "A","B","O","AB"
RHESUSNoRhesus (Rh) factor of the donor/CBUrhesusType1

rhesusType: "P","N"

NOTE: "+" and "-" are not supported

ETHNNoEthnic group of the donor/CBUethnType4

ethnType: "AFNA","AFSS", "ASSW", "ASSO", "ASCE", "ASSE", "ASNE", "ASOC", "CAEU",
"CAER", "CANA", "CAAU", "HICA", "HISA", "AF", "AS", "CA", "HI", "MX", "OT","UK"

CCR5NoCCR5 status of the donor/CBUccr5Type2ccr5Type: "DD","WW","DW"
HLAYesHLA of the donor/cbuhlaType Explained separately at hlaType 2.3
KIRNoKIR genotype of the donor/CBUkirType Explained separately at kirType 2.4
IDMNoInfectious disease markers (IDM) and other relevant tests of the donor/CBUidmType Explained separately at idmType 2.5
RSV_PATNoUnique identifier of the patient the donor/CBU is reserved for (STATUS=RS).String17

The value comprises the EMDIS patient identification, where the patient search centre is an EMDIS member, otherwise the value is empty. For example: AU9654021, DE275342, US2277450.

NOTE: This field is not required for status "RS" and can be transmitted as empty if privacy concerns exist.

STATUSYesStatus of the donor/CBUstatusType2statusType: "AV","TU","RS" ("DE" is not supported yet, "RE" not valid for CBUs)
STAT_END_DATENoDate until which the current status will be applicablebareDateType10

Date without timezone information, example 1968-06-28, Date Delimiter = "-"

 

2.3 hlaType elements

HlaType fields can be divided in hlaSerFieldsType and hlaDnaFieldsType

hlaSerFieldsType: HLA values obtained by serological typing methods

hlaSerFieldsType = “<FIELD1>” string of max length 5 “</FIELD1>”, “<FIELD2>” string of max length 5 “</FIELD2>”;

Example: <SER><FIELD1>1</FIELD1><FIELD2>5</FIELD2></SER>

Serological typing results can be given for loci that are defined as hlaLocusType. These loci include HLA-A, -B, -C, -DRB1, -DQB1.

 

hlaDnaFieldsType: HLA values obtained by DNA based typing methods

hlaDnaFieldsType = “<FIELD1>” string of max length 20 “</FIELD1>”, “<FIELD2>” string of max length 20 “</FIELD2>”;

Exanple: <DNA><FIELD1>01:01</FIELD1><FIELD2>05:01</FIELD2></DNA>

DNA typing results can be given for loci that are defined as hlaLocusType and hlaLocusDnaOnlyType. These loci include HLA-A, -B, -C, -DRB1, -DQB1, -DRB3, -DRB4, -DRB5, -DQA1, -DPA1, -DPB1.

Finally, '01:XX' is equivalent to '01'. Both codes '01:XX' and '01' are allowed.

 

Minimal required elements

Minimal typing values for Donor: A (either SER or DNA), B (either SER or DNA)

Minimal typing values for CBU: A (either SER or DNA), B (either SER or DNA), DRB1 (either SER or DNA)

 

NOTES: - It is not possible anymore to submit string HLA values; only single values are allowed.

- When a donor or CBU has homozygous alleles/values, please use the following notation:

<HLA><A><SER><FIELD1>1</FIELD1><FIELD2 /></SER></A> ...
or
<DQB1><DNA><FIELD1>05:02:01G</FIELD1><FIELD2 /></DNA></DQB1>

 

Field Identifier

Required

Description

Type

Length

Comment

SERdepends on content type and DNA fields providedHLA values obtained by serological typing methodshlaSerFieldsType5Each SER element contains two other elements: FIELD1 and FIELD2
DNAdepends on content type and SER fields providedHLA values obtained by DNA based typing methodshlaDnaFieldsType20Each DNA element contains two other elements: FIELD1 and FIELD2
FIELD1 HLA value of allele 1 5 or 20Element within the element SER and DNA
FIELD2 HLA value of allele 2 5 or 20Element within the element SER and DNA
AYesHLA-A valueshlaLocusType Both SER and DNA possible; either SER or DNA values required
BYesHLA-B valueshlaLocusType Both SER and DNA possible; either SER or DNA values required
CNoHLA-C valueshlaLocusType Both SER and DNA possible
DRB1Yes (CBU) No (Donor)HLA-DRB1 valueshlaLocusType Both SER and DNA possible; either SER or DNA values required for CBU
DRB3NoHLA-DRB3 valueshlaLocusDnaOnlyType Only DNA possible
DRB4NoHLA-DRB4 valueshlaLocusDnaOnlyType Only DNA possible
DRB5NoHLA-DRB5 valueshlaLocusDnaOnlyType Only DNA possible
DQA1NoHLA-DQA1 valueshlaLocusDnaOnlyType Only DNA possible
DQB1NoHLA-DQB1 valueshlaLocusType Both SER and DNA possible
DPA1NoHLA-DPA1 valueshlaLocusDnaOnlyType Only DNA possible
DPB1NoHLA-DPB1 valueshlaLocusDnaOnlyType Only DNA possible

 

2.4 kirType elements

The kirType Field Definitions consists of the type: kirLocusType. This is defined as a String with 3 characters: "POS" or "NEG". "POS" means "Presence of KIR gene", "NEG" means "Absence of KIR gene".

The following elements are possible and in this specific order:

<KIR2DL1>,<KIR2DL2>,<KIR2DL3>,<KIR2DL4>,<KIR2DL5A>,<KIR2DL5B>,<KIR2DS1>,<KIR2DS2>,<KIR2DS3>,<KIR2DS4>,<KIR2DS5>,<KIR2DP1>,<KIR3DL1>,<KIR3DL2>,<KIR3DL3>,<KIR3DS1>,<KIR3DP1>.

There is another field called <KIR_GL> (URI that refers to a GL-string registered with a GL-service or direct GL-string for absence / presence) this field is not used at the moment and must be empty.

 

Field Identifier

Required

Description

Type

Length

Comment

KIR gene e.g. KIR2DL1NoKIR genotype e.g. KIR gene 2DL1kirLocusType3valid values: "POS" = presence of KIR gene; "NEG" = absence of KIR gene

2.5 idmType elements

There are many infectious disease markers (IDM) possible in the element IDM. Many IDM elements can have either the values idmValueType or idmValueExtType

idmValueType includes the following values: "P","N"

idemValueExtType include the following values: “P”,“G”,“M”,“B”,“H”,“O”,“N”

Field Identifier

Required

Description

Type

Length

Comment

CMVNoCMV statusidmValueExtType1

idmValueExtType: “P”,“G”,“M”,“B”,“H”,“O”,“N”

EMDIS data dictionary also has a ‘Q’ (questionable / unclear) but that will not be applicable within the BMDW data submission file.

CMV_NATNoCMV NAT statusidmValueType1idmValueType: "P","N"
CMV_DATENoDate of CMV testbareDateTyp10Date without timezone information, example 1968-06-28, Date Delimiter = "-"
HBS_AGNoHepatitis B status (hepatitis B surface antigen)idmValueType1idmValueType: "P","N"
ANTI_HBCNoHepatitis B status (antibody to hepatitis B core antigen)idmValueType1idmValueType: "P","N"
ANTI_HBSNoHepatitis B status (antibody to hepatitis B surface antigen)idmValueType1idmValueType: "P","N"
ANTI_HCVNoHepatitis C status (antibody to hepatitis C virus)idmValueType1idmValueType: "P","N"
ANTI_HIV_12NoAnti-HIV 1/2 statusidmValueType1idmValueType: "P","N"
HIV_1_NATNoHIV-1 NAT statusidmValueType1idmValueType: "P","N"
HIV_P24NoHIV p24 statusidmValueType1idmValueType: "P","N"
HCV_NATNoHCV NAT statusidmValueType1idmValueType: "P","N"
ANTI_HTLVNoAntibody to HTLV I/IIidmValueType1idmValueType: "P","N"
SYPHILISNoSyphilis statusidmValueType1idmValueType: "P","N"
WNVNoWNV statusidmValueType1idmValueType: "P","N"
CHAGASNoChagas statusidmValueType1idmValueType: "P","N"
EBVNoEBV statusidmValueExtType1

idmValueExtType: “P”,“G”,“M”,“B”,“H”,“O”,“N”

EMDIS data dictionary also has a ‘Q’ (questionable / unclear) but that will not be applicable within the BMDW data submission file. Please leave blank for Q.

TOXONoToxoplasmosis statusidmValueExtType1

idmValueExtType: “P”,“G”,“M”,“B”,“H”,“O”,“N”

EMDIS data dictionary also has a ‘Q’ (questionable / unclear) but that will not be applicable within the BMDW data submission file. Please leave blank for Q.

HBV_NATNoHBV NAT statusidmValueType1idmValueType: "P","N"
PB19_NATNoParvoB19 NAT statusidmValueType1idmValueType: "P","N"
ALTNoAlanine aminotransferase status in units per litreShort Number, no decimals, minimal value is 1

2.6

 

 

 

 

 

XML example files

Below you can find two XML example files: one for donors and 1 for CBUs.

Both files contain only 2 records, but in those two records almost all possible elements contain a value. It can help you to check the order of the elements in your own XML file. Please be aware that values like GRID are fictive and do not follow the rules for the check character.

 

 

 

 

  • No labels