samples().get_list()¶
Given a list of sample barcodes (of length 16, eg TARGET-B9-7268-01A), this endpoint returns all available “biospecimen” information about this sample, the associated case barcode, a list of associated aliquots, and a list of “data_details” blocks describing each of the data files associated with this sample.
Example:
curl --data '{"sample_barcodes": ["TARGET-10-DCC001-03A","TARGET-52-PASGGN-01A"]}' https://api-dot-isb-cgc.appspot.com/_ah/api/isb_cgc_target_api/v3/target/samples
API explorer example:
Click here to see this endpoint in Google’s API explorer.
Python API Client Example:
from googleapiclient.discovery import build
import httplib2
def get_unauthorized_service():
api = 'isb_cgc_target_api'
version = 'v3'
site = 'https://api-dot-isb-cgc.appspot.com'
discovery_url = '%s/_ah/api/discovery/v1/apis/%s/%s/rest' % (site, api, version)
return build(api, version, discoveryServiceUrl=discovery_url, http=httplib2.Http())
service = get_unauthorized_service()
data = service.samples().get(body={"case_barcodes": ["TARGET-52-PASGGN-01A","TARGET-10-DCC001-03A"]}).execute()
Request
HTTP request:
POST https://api-dot-isb-cgc.appspot.com/_ah/api/isb_cgc_target_api/v3/target/samples
- {
- “sample_barcodes”: [
- “TARGET-10-DCC001-03A”, “TARGET-52-PASGGN-01A”
]
}
Parameters
| Parameter name | Value | Description | ||
|---|---|---|---|---|
|
10 | 50 | ||
| analysis_workflow_type | string | Optional. | ||
| data_category | string | Optional. | ||
| data_format | string | Optional. | ||
| data_type | string | Optional. | ||
| endpoint_type | string | Optional. | ||
| experimental_strategy | string | Optional. | ||
| platform | string | Optional. | ||
| sample_barcode | string | Required. |
Response
If successful, this method returns a response body with the following structure:
{
"aliquots": [string],
"biospecimen_data": {
"age_at_diagnosis": integer,
"case_barcode": string,
"case_gdc_id": string,
"days_to_birth": integer,
"days_to_death": integer,
"days_to_last_followup": integer,
"days_to_last_known_alive": integer,
"disease_code": string,
"endpoint_type": string,
"ethnicity": string,
"event_free_survival": integer,
"first_event": string,
"gender": string,
"program_name": string,
"project_short_name": string,
"protocol": string,
"race": string,
"sample_barcode": string,
"sample_gdc_id": string,
"sample_type": string,
"summary_file_count": integer,
"tumor_code": string,
"vital_status": string,
"wbc_at_diagnosis": number,
"year_of_diagnosis": integer,
"year_of_last_follow_up": integer
},
"case_barcode": string,
"data_details": [
{
"access": string,
"analysis_workflow_type": string,
"data_category": string,
"data_format": string,
"data_type": string,
"disease_code": string,
"endpoint_type": string,
"experimental_strategy": string,
"file_gdc_id": string,
"file_name": string,
"file_name_key": string,
"file_size": string,
"index_file_name": string,
"platform": string,
"program_name": string,
"project_short_name": string,
"sample_barcode": string,
"sample_gdc_id": string,
"sample_type": string
}
],
"data_details_count": integer,
"sample_barcode": string
}
| Parameter name | Value | Description | ||
|---|---|---|---|---|
|
10 | 50 | ||
| aliquots[] | list | List of barcodes of aliquots taken from this participant. | ||
| biospecimen_data | nested object | Biospecimen data about the sample. | ||
| biospecimen_data.age_at_diagnosis | integer | Age at which a condition or disease was first diagnosed in years. | ||
| biospecimen_data.case_barcode | string | Case barcode. | ||
| biospecimen_data.case_gdc_id | string | The GDC assigned id for the case | ||
| biospecimen_data.days_to_birth | integer | Time interval from a person’s date of birth to the date of initial pathologic diagnosis, represented as a calculated number of days. | ||
| biospecimen_data.days_to_death | integer | Time interval from a person’s date of death to the date of initial pathologic diagnosis, represented as a calculated number of days. | ||
| biospecimen_data.days_to_last_followup | integer | Time interval from the date of last followup to the date of initial pathologic diagnosis, represented as a calculated number of days. | ||
| biospecimen_data.days_to_last_known_alive | integer | The number of days between diagnosis and when the individual was last known to be alive. | ||
| biospecimen_data.disease_code | string | The short name for the type of disease | ||
| biospecimen_data.endpoint_type | string | Which type of GDC Case API was used, either legacy or current | ||
| biospecimen_data.ethnicity | string | The text for reporting information about ethnicity based on the Office of Management and Budget (OMB) categories. | ||
| biospecimen_data.event_free_survival | integer | The length of time after primary treatment for a cancer ends that the patient remains free of certain complications or events. | ||
| biospecimen_data.first_event | string | The first event after the diagnosis of cancer. | ||
| biospecimen_data.gender | string | Text designations that identify gender. | ||
| biospecimen_data.program_name | string | Project name, e.g. ‘TCGA’. | ||
| biospecimen_data.project_short_name | string | Tumor type abbreviation, e.g. ‘BRCA’. | ||
| biospecimen_data.protocol | string | A list detailed plans of scientific or medical experiments, treatments, or procedures. | ||
| biospecimen_data.race | string | The text for reporting information about race based on the Office of Management and Budget (OMB) categories. | ||
| biospecimen_data.sample_barcode | string | The barcode assigned by TCGA to a sample from a Participant. | ||
| biospecimen_data.sample_gdc_id | string | The GDC assigned id for the sample | ||
| biospecimen_data.sample_type | string | The type of the sample tumor or normal tissue cell or blood sample provided by a participant. | ||
| biospecimen_data.summary_file_count | integer | The count of files associated with the sample | ||
| biospecimen_data.tumor_code | string | Code representing the type of tumor. | ||
| biospecimen_data.vital_status | string | The survival state of the person registered on the protocol. | ||
| biospecimen_data.wbc_at_diagnosis | number | White blood cell range at diagnosis | ||
| biospecimen_data.year_of_diagnosis | integer | Numeric value to represent the year of an individual’s initial pathologic diagnosis of cancer. | ||
| biospecimen_data.year_of_last_follow_up | integer | Numeric value to represent the year of an individual’s last follow up. | ||
| case | string | Case barcode. | ||
| data_details[] | list | List of information about each file associated with the sample barcode. | ||
| data_details[].access | string | An indication of the security protocol necessary to fulfill in order to access the data from the file, e.g. open, controlled. | ||
| data_details[].analysis_workflow_type | string | The type of workflow used to generate the data file, e.g. ‘BWA-aln’, ‘STAR 2-Pass’, ‘BWA with Mark Duplicates and Cocleaning’ | ||
| data_details[].data_category | string | The higher level categorization of the data_type in the file, e.g. ‘Biospecimen’, ‘Clinical’, ‘Raw sequencing data’, ‘Simple nucleotide variation’ | ||
| data_details[].data_format | string | The format of the data file, e.g. ‘BAM’, ‘BCR XML’, ‘TXT’ | ||
| data_details[].data_type | string | Data type stored in Google Cloud Storage, e.g. ‘Clinical Supplement’, ‘Biospecimen Supplement’, ‘Aligned reads’, ‘Genotypes’, ‘Diagnostic image’ | ||
| data_details[].disease_code | string | The disease abbeviation, e.g. ‘ACC’, ‘UVM’, ‘ALL’, ‘WT’ | ||
| data_details[].endpoint_type | string | The GDC files API the data file information was gottern from, e.g. ‘legacy’, ‘current’ | ||
| data_details[].experimental_strategy | string | The sequencing, array or other strategy used to generate the data file, e.g. ‘RNA-Seq’, ‘WGS’, ‘Genotyping array’ | ||
| data_details[].file_gdc_id | string | The GDC assigned id for the file | ||
| data_details[].file_name | string | Name of the datafile stored on the GDC system. | ||
| data_details[].file_name_key | string | Google Cloud Storage path to file. | ||
| data_details[].file_size | string | The size of the file | ||
| data_details[].index_file_name | string | For BAM files, the name of its index file | ||
| data_details[].platform | string | The sequencing or array platform used, e.g. Illumina HiSeq, Ion Torrent PGM, Affymetrix SNP Array 6.0. | ||
| data_details[].program_name | string | The program for which the data was generated, e.g. ‘CCLE’, ‘TARGET’,’TCGA’. | ||
| data_details[].project_short_name | string | The id of the project, e.g. ‘CCLE-ACC’, ‘CCLE-UVM’, ‘TARGET-ALL-P1’, ‘ TARGET-WT’, ‘TCGA-ACC’, ‘TCGA-UVM’ | ||
| data_details[].sample_barcode | string | Sample barcode. | ||
| data_details[].sample_gdc_id | string | The GDC assigned id for the sample | ||
| data_details[].sample_type | string | The sample type, e.g. ‘01’, ‘10’, ‘11’ | ||
| data_details_count | integer | Number of files associated with the sample barcode. |