samples().get_list()¶

Given a list of sample barcodes (of length 16, eg TARGET-B9-7268-01A), this endpoint returns all available “biospecimen” information about this sample, the associated case barcode, a list of associated aliquots, and a list of “data_details” blocks describing each of the data files associated with this sample.

Example:

curl --data '{"sample_barcodes": ["TARGET-10-DCC001-03A","TARGET-52-PASGGN-01A"]}' https://api-dot-isb-cgc.appspot.com/_ah/api/isb_cgc_target_api/v3/target/samples

API explorer example:

Click here to see this endpoint in Google’s API explorer.

Python API Client Example:

from googleapiclient.discovery import build
import httplib2

def get_unauthorized_service():
        api = 'isb_cgc_target_api'
        version = 'v3'
        site = 'https://api-dot-isb-cgc.appspot.com'
        discovery_url = '%s/_ah/api/discovery/v1/apis/%s/%s/rest' % (site, api, version)
        return build(api, version, discoveryServiceUrl=discovery_url, http=httplib2.Http())

service = get_unauthorized_service()
data = service.samples().get(body={"case_barcodes": ["TARGET-52-PASGGN-01A","TARGET-10-DCC001-03A"]}).execute()

Request

HTTP request:

POST https://api-dot-isb-cgc.appspot.com/_ah/api/isb_cgc_target_api/v3/target/samples

{

“sample_barcodes”: [: “TARGET-10-DCC001-03A”, “TARGET-52-PASGGN-01A”

]

}

Parameters

Parameter name Value Description

widths:	50

10

50

analysis_workflow_type string Optional.

data_category string Optional.

data_format string Optional.

data_type string Optional.

endpoint_type string Optional.

experimental_strategy string Optional.

platform string Optional.

sample_barcode string Required.

Response

If successful, this method returns a response body with the following structure:

{
  "aliquots": [string],
  "biospecimen_data": {
    "age_at_diagnosis": integer,
    "case_barcode": string,
    "case_gdc_id": string,
    "days_to_birth": integer,
    "days_to_death": integer,
    "days_to_last_followup": integer,
    "days_to_last_known_alive": integer,
    "disease_code": string,
    "endpoint_type": string,
    "ethnicity": string,
    "event_free_survival": integer,
    "first_event": string,
    "gender": string,
    "program_name": string,
    "project_short_name": string,
    "protocol": string,
    "race": string,
    "sample_barcode": string,
    "sample_gdc_id": string,
    "sample_type": string,
    "summary_file_count": integer,
    "tumor_code": string,
    "vital_status": string,
    "wbc_at_diagnosis": number,
    "year_of_diagnosis": integer,
    "year_of_last_follow_up": integer
  },
  "case_barcode": string,
  "data_details": [
    {
      "access": string,
      "analysis_workflow_type": string,
      "data_category": string,
      "data_format": string,
      "data_type": string,
      "disease_code": string,
      "endpoint_type": string,
      "experimental_strategy": string,
      "file_gdc_id": string,
      "file_name": string,
      "file_name_key": string,
      "file_size": string,
      "index_file_name": string,
      "platform": string,
      "program_name": string,
      "project_short_name": string,
      "sample_barcode": string,
      "sample_gdc_id": string,
      "sample_type": string
    }
  ],
  "data_details_count": integer,
  "sample_barcode": string
}

Parameter name Value Description

widths:	50

10

50

aliquots[] list List of barcodes of aliquots taken from this participant.

biospecimen_data nested object Biospecimen data about the sample.

biospecimen_data.age_at_diagnosis integer Age at which a condition or disease was first diagnosed in years.

biospecimen_data.case_barcode string Case barcode.

biospecimen_data.case_gdc_id string The GDC assigned id for the case

biospecimen_data.days_to_birth integer Time interval from a person’s date of birth to the date of initial pathologic diagnosis, represented as a calculated number of days.

biospecimen_data.days_to_death integer Time interval from a person’s date of death to the date of initial pathologic diagnosis, represented as a calculated number of days.

biospecimen_data.days_to_last_followup integer Time interval from the date of last followup to the date of initial pathologic diagnosis, represented as a calculated number of days.

biospecimen_data.days_to_last_known_alive integer The number of days between diagnosis and when the individual was last known to be alive.

biospecimen_data.disease_code string The short name for the type of disease

biospecimen_data.endpoint_type string Which type of GDC Case API was used, either legacy or current

biospecimen_data.ethnicity string The text for reporting information about ethnicity based on the Office of Management and Budget (OMB) categories.

biospecimen_data.event_free_survival integer The length of time after primary treatment for a cancer ends that the patient remains free of certain complications or events.

biospecimen_data.first_event string The first event after the diagnosis of cancer.

biospecimen_data.gender string Text designations that identify gender.

biospecimen_data.program_name string Project name, e.g. ‘TCGA’.

biospecimen_data.project_short_name string Tumor type abbreviation, e.g. ‘BRCA’.

biospecimen_data.protocol string A list detailed plans of scientific or medical experiments, treatments, or procedures.

biospecimen_data.race string The text for reporting information about race based on the Office of Management and Budget (OMB) categories.

biospecimen_data.sample_barcode string The barcode assigned by TCGA to a sample from a Participant.

biospecimen_data.sample_gdc_id string The GDC assigned id for the sample

biospecimen_data.sample_type string The type of the sample tumor or normal tissue cell or blood sample provided by a participant.

biospecimen_data.summary_file_count integer The count of files associated with the sample

biospecimen_data.tumor_code string Code representing the type of tumor.

biospecimen_data.vital_status string The survival state of the person registered on the protocol.

biospecimen_data.wbc_at_diagnosis number White blood cell range at diagnosis

biospecimen_data.year_of_diagnosis integer Numeric value to represent the year of an individual’s initial pathologic diagnosis of cancer.

biospecimen_data.year_of_last_follow_up integer Numeric value to represent the year of an individual’s last follow up.

case string Case barcode.

data_details[] list List of information about each file associated with the sample barcode.

data_details[].access string An indication of the security protocol necessary to fulfill in order to access the data from the file, e.g. open, controlled.

data_details[].analysis_workflow_type string The type of workflow used to generate the data file, e.g. ‘BWA-aln’, ‘STAR 2-Pass’, ‘BWA with Mark Duplicates and Cocleaning’

data_details[].data_category string The higher level categorization of the data_type in the file, e.g. ‘Biospecimen’, ‘Clinical’, ‘Raw sequencing data’, ‘Simple nucleotide variation’

data_details[].data_format string The format of the data file, e.g. ‘BAM’, ‘BCR XML’, ‘TXT’

data_details[].data_type string Data type stored in Google Cloud Storage, e.g. ‘Clinical Supplement’, ‘Biospecimen Supplement’, ‘Aligned reads’, ‘Genotypes’, ‘Diagnostic image’

data_details[].disease_code string The disease abbeviation, e.g. ‘ACC’, ‘UVM’, ‘ALL’, ‘WT’

data_details[].endpoint_type string The GDC files API the data file information was gottern from, e.g. ‘legacy’, ‘current’

data_details[].experimental_strategy string The sequencing, array or other strategy used to generate the data file, e.g. ‘RNA-Seq’, ‘WGS’, ‘Genotyping array’

data_details[].file_gdc_id string The GDC assigned id for the file

data_details[].file_name string Name of the datafile stored on the GDC system.

data_details[].file_name_key string Google Cloud Storage path to file.

data_details[].file_size string The size of the file

data_details[].index_file_name string For BAM files, the name of its index file

data_details[].platform string The sequencing or array platform used, e.g. Illumina HiSeq, Ion Torrent PGM, Affymetrix SNP Array 6.0.

data_details[].program_name string The program for which the data was generated, e.g. ‘CCLE’, ‘TARGET’,’TCGA’.

data_details[].project_short_name string The id of the project, e.g. ‘CCLE-ACC’, ‘CCLE-UVM’, ‘TARGET-ALL-P1’, ‘ TARGET-WT’, ‘TCGA-ACC’, ‘TCGA-UVM’

data_details[].sample_barcode string Sample barcode.

data_details[].sample_gdc_id string The GDC assigned id for the sample

data_details[].sample_type string The sample type, e.g. ‘01’, ‘10’, ‘11’

data_details_count integer Number of files associated with the sample barcode.

Have feedback or corrections? You can file an issue here or email us at feedback@isb-cgc.org.

samples().get_list()¶

Previous topic

Next topic

This Page