1 <html><body> 2 <style> 3 4 body, h1, h2, h3, div, span, p, pre, a { 5 margin: 0; 6 padding: 0; 7 border: 0; 8 font-weight: inherit; 9 font-style: inherit; 10 font-size: 100%; 11 font-family: inherit; 12 vertical-align: baseline; 13 } 14 15 body { 16 font-size: 13px; 17 padding: 1em; 18 } 19 20 h1 { 21 font-size: 26px; 22 margin-bottom: 1em; 23 } 24 25 h2 { 26 font-size: 24px; 27 margin-bottom: 1em; 28 } 29 30 h3 { 31 font-size: 20px; 32 margin-bottom: 1em; 33 margin-top: 1em; 34 } 35 36 pre, code { 37 line-height: 1.5; 38 font-family: Monaco, 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Lucida Console', monospace; 39 } 40 41 pre { 42 margin-top: 0.5em; 43 } 44 45 h1, h2, h3, p { 46 font-family: Arial, sans serif; 47 } 48 49 h1, h2, h3 { 50 border-bottom: solid #CCC 1px; 51 } 52 53 .toc_element { 54 margin-top: 0.5em; 55 } 56 57 .firstline { 58 margin-left: 2 em; 59 } 60 61 .method { 62 margin-top: 1em; 63 border: solid 1px #CCC; 64 padding: 1em; 65 background: #EEE; 66 } 67 68 .details { 69 font-weight: bold; 70 font-size: 14px; 71 } 72 73 </style> 74 75 <h1><a href="genomics_v1.html">Genomics API</a> . <a href="genomics_v1.reads.html">reads</a></h1> 76 <h2>Instance Methods</h2> 77 <p class="toc_element"> 78 <code><a href="#search">search(body, x__xgafv=None)</a></code></p> 79 <p class="firstline">Gets a list of reads for one or more read group sets.</p> 80 <p class="toc_element"> 81 <code><a href="#search_next">search_next(previous_request, previous_response)</a></code></p> 82 <p class="firstline">Retrieves the next page of results.</p> 83 <h3>Method Details</h3> 84 <div class="method"> 85 <code class="details" id="search">search(body, x__xgafv=None)</code> 86 <pre>Gets a list of reads for one or more read group sets. 87 88 For the definitions of read group sets and other genomics resources, see 89 [Fundamentals of Google 90 Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics) 91 92 Reads search operates over a genomic coordinate space of reference sequence 93 & position defined over the reference sequences to which the requested 94 read group sets are aligned. 95 96 If a target positional range is specified, search returns all reads whose 97 alignment to the reference genome overlap the range. A query which 98 specifies only read group set IDs yields all reads in those read group 99 sets, including unmapped reads. 100 101 All reads returned (including reads on subsequent pages) are ordered by 102 genomic coordinate (by reference sequence, then position). Reads with 103 equivalent genomic coordinates are returned in an unspecified order. This 104 order is consistent, such that two queries for the same content (regardless 105 of page size) yield reads in the same order across their respective streams 106 of paginated responses. 107 108 Implements 109 [GlobalAllianceApi.searchReads](https://github.com/ga4gh/schemas/blob/v0.5.1/src/main/resources/avro/readmethods.avdl#L85). 110 111 Args: 112 body: object, The request body. (required) 113 The object takes the form of: 114 115 { # The read search request. 116 "end": "A String", # The end position of the range on the reference, 0-based exclusive. If 117 # specified, `referenceName` must also be specified. 118 "readGroupIds": [ # The IDs of the read groups within which to search for reads. All specified 119 # read groups must belong to the same read group sets. Must specify one of 120 # `readGroupSetIds` or `readGroupIds`. 121 "A String", 122 ], 123 "pageSize": 42, # The maximum number of results to return in a single page. If unspecified, 124 # defaults to 256. The maximum value is 2048. 125 "start": "A String", # The start position of the range on the reference, 0-based inclusive. If 126 # specified, `referenceName` must also be specified. 127 "pageToken": "A String", # The continuation token, which is used to page through large result sets. 128 # To get the next page of results, set this parameter to the value of 129 # `nextPageToken` from the previous response. 130 "referenceName": "A String", # The reference sequence name, for example `chr1`, `1`, or `chrX`. If set to 131 # `*`, only unmapped reads are returned. If unspecified, all reads (mapped 132 # and unmapped) are returned. 133 "readGroupSetIds": [ # The IDs of the read groups sets within which to search for reads. All 134 # specified read group sets must be aligned against a common set of reference 135 # sequences; this defines the genomic coordinates for the query. Must specify 136 # one of `readGroupSetIds` or `readGroupIds`. 137 "A String", 138 ], 139 } 140 141 x__xgafv: string, V1 error format. 142 Allowed values 143 1 - v1 error format 144 2 - v2 error format 145 146 Returns: 147 An object of the form: 148 149 { # The read search response. 150 "nextPageToken": "A String", # The continuation token, which is used to page through large result sets. 151 # Provide this value in a subsequent request to return the next page of 152 # results. This field will be empty if there aren't any additional results. 153 "alignments": [ # The list of matching alignments sorted by mapped genomic coordinate, 154 # if any, ascending in position within the same reference. Unmapped reads, 155 # which have no position, are returned contiguously and are sorted in 156 # ascending lexicographic order by fragment name. 157 { # A read alignment describes a linear alignment of a string of DNA to a 158 # reference sequence, in addition to metadata 159 # about the fragment (the molecule of DNA sequenced) and the read (the bases 160 # which were read by the sequencer). A read is equivalent to a line in a SAM 161 # file. A read belongs to exactly one read group and exactly one 162 # read group set. 163 # 164 # For more genomics resource definitions, see [Fundamentals of Google 165 # Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics) 166 # 167 # ### Reverse-stranded reads 168 # 169 # Mapped reads (reads having a non-null `alignment`) can be aligned to either 170 # the forward or the reverse strand of their associated reference. Strandedness 171 # of a mapped read is encoded by `alignment.position.reverseStrand`. 172 # 173 # If we consider the reference to be a forward-stranded coordinate space of 174 # `[0, reference.length)` with `0` as the left-most position and 175 # `reference.length` as the right-most position, reads are always aligned left 176 # to right. That is, `alignment.position.position` always refers to the 177 # left-most reference coordinate and `alignment.cigar` describes the alignment 178 # of this read to the reference from left to right. All per-base fields such as 179 # `alignedSequence` and `alignedQuality` share this same left-to-right 180 # orientation; this is true of reads which are aligned to either strand. For 181 # reverse-stranded reads, this means that `alignedSequence` is the reverse 182 # complement of the bases that were originally reported by the sequencing 183 # machine. 184 # 185 # ### Generating a reference-aligned sequence string 186 # 187 # When interacting with mapped reads, it's often useful to produce a string 188 # representing the local alignment of the read to reference. The following 189 # pseudocode demonstrates one way of doing this: 190 # 191 # out = "" 192 # offset = 0 193 # for c in read.alignment.cigar { 194 # switch c.operation { 195 # case "ALIGNMENT_MATCH", "SEQUENCE_MATCH", "SEQUENCE_MISMATCH": 196 # out += read.alignedSequence[offset:offset+c.operationLength] 197 # offset += c.operationLength 198 # break 199 # case "CLIP_SOFT", "INSERT": 200 # offset += c.operationLength 201 # break 202 # case "PAD": 203 # out += repeat("*", c.operationLength) 204 # break 205 # case "DELETE": 206 # out += repeat("-", c.operationLength) 207 # break 208 # case "SKIP": 209 # out += repeat(" ", c.operationLength) 210 # break 211 # case "CLIP_HARD": 212 # break 213 # } 214 # } 215 # return out 216 # 217 # ### Converting to SAM's CIGAR string 218 # 219 # The following pseudocode generates a SAM CIGAR string from the 220 # `cigar` field. Note that this is a lossy conversion 221 # (`cigar.referenceSequence` is lost). 222 # 223 # cigarMap = { 224 # "ALIGNMENT_MATCH": "M", 225 # "INSERT": "I", 226 # "DELETE": "D", 227 # "SKIP": "N", 228 # "CLIP_SOFT": "S", 229 # "CLIP_HARD": "H", 230 # "PAD": "P", 231 # "SEQUENCE_MATCH": "=", 232 # "SEQUENCE_MISMATCH": "X", 233 # } 234 # cigarStr = "" 235 # for c in read.alignment.cigar { 236 # cigarStr += c.operationLength + cigarMap[c.operation] 237 # } 238 # return cigarStr 239 "info": { # A map of additional read alignment information. This must be of the form 240 # map<string, string[]> (string key mapping to a list of string values). 241 "a_key": [ 242 "", 243 ], 244 }, 245 "duplicateFragment": True or False, # The fragment is a PCR or optical duplicate (SAM flag 0x400). 246 "readGroupSetId": "A String", # The ID of the read group set this read belongs to. A read belongs to 247 # exactly one read group set. 248 "alignedQuality": [ # The quality of the read sequence contained in this alignment record 249 # (equivalent to QUAL in SAM). 250 # `alignedSequence` and `alignedQuality` may be shorter than the full read 251 # sequence and quality. This will occur if the alignment is part of a 252 # chimeric alignment, or if the read was trimmed. When this occurs, the CIGAR 253 # for this read will begin/end with a hard clip operator that will indicate 254 # the length of the excised sequence. 255 42, 256 ], 257 "failedVendorQualityChecks": True or False, # Whether this read did not pass filters, such as platform or vendor quality 258 # controls (SAM flag 0x200). 259 "fragmentName": "A String", # The fragment name. Equivalent to QNAME (query template name) in SAM. 260 "id": "A String", # The server-generated read ID, unique across all reads. This is different 261 # from the `fragmentName`. 262 "properPlacement": True or False, # The orientation and the distance between reads from the fragment are 263 # consistent with the sequencing protocol (SAM flag 0x2). 264 "readGroupId": "A String", # The ID of the read group this read belongs to. A read belongs to exactly 265 # one read group. This is a server-generated ID which is distinct from SAM's 266 # RG tag (for that value, see 267 # ReadGroup.name). 268 "supplementaryAlignment": True or False, # Whether this alignment is supplementary. Equivalent to SAM flag 0x800. 269 # Supplementary alignments are used in the representation of a chimeric 270 # alignment. In a chimeric alignment, a read is split into multiple 271 # linear alignments that map to different reference contigs. The first 272 # linear alignment in the read will be designated as the representative 273 # alignment; the remaining linear alignments will be designated as 274 # supplementary alignments. These alignments may have different mapping 275 # quality scores. In each linear alignment in a chimeric alignment, the read 276 # will be hard clipped. The `alignedSequence` and 277 # `alignedQuality` fields in the alignment record will only 278 # represent the bases for its respective linear alignment. 279 "numberReads": 42, # The number of reads in the fragment (extension to SAM flag 0x1). 280 "fragmentLength": 42, # The observed length of the fragment, equivalent to TLEN in SAM. 281 "secondaryAlignment": True or False, # Whether this alignment is secondary. Equivalent to SAM flag 0x100. 282 # A secondary alignment represents an alternative to the primary alignment 283 # for this read. Aligners may return secondary alignments if a read can map 284 # ambiguously to multiple coordinates in the genome. By convention, each read 285 # has one and only one alignment where both `secondaryAlignment` 286 # and `supplementaryAlignment` are false. 287 "alignedSequence": "A String", # The bases of the read sequence contained in this alignment record, 288 # **without CIGAR operations applied** (equivalent to SEQ in SAM). 289 # `alignedSequence` and `alignedQuality` may be 290 # shorter than the full read sequence and quality. This will occur if the 291 # alignment is part of a chimeric alignment, or if the read was trimmed. When 292 # this occurs, the CIGAR for this read will begin/end with a hard clip 293 # operator that will indicate the length of the excised sequence. 294 "readNumber": 42, # The read number in sequencing. 0-based and less than numberReads. This 295 # field replaces SAM flag 0x40 and 0x80. 296 "alignment": { # A linear alignment can be represented by one CIGAR string. Describes the # The linear alignment for this alignment record. This field is null for 297 # unmapped reads. 298 # mapped position and local alignment of the read to the reference. 299 "position": { # An abstraction for referring to a genomic position, in relation to some # The position of this alignment. 300 # already known reference. For now, represents a genomic position as a 301 # reference name, a base number on that reference (0-based), and a 302 # determination of forward or reverse strand. 303 "position": "A String", # The 0-based offset from the start of the forward strand for that reference. 304 "reverseStrand": True or False, # Whether this position is on the reverse strand, as opposed to the forward 305 # strand. 306 "referenceName": "A String", # The name of the reference in whatever reference set is being used. 307 }, 308 "cigar": [ # Represents the local alignment of this sequence (alignment matches, indels, 309 # etc) against the reference. 310 { # A single CIGAR operation. 311 "referenceSequence": "A String", # `referenceSequence` is only used at mismatches 312 # (`SEQUENCE_MISMATCH`) and deletions (`DELETE`). 313 # Filling this field replaces SAM's MD tag. If the relevant information is 314 # not available, this field is unset. 315 "operation": "A String", 316 "operationLength": "A String", # The number of genomic bases that the operation runs for. Required. 317 }, 318 ], 319 "mappingQuality": 42, # The mapping quality of this alignment. Represents how likely 320 # the read maps to this position as opposed to other locations. 321 # 322 # Specifically, this is -10 log10 Pr(mapping position is wrong), rounded to 323 # the nearest integer. 324 }, 325 "nextMatePosition": { # An abstraction for referring to a genomic position, in relation to some # The mapping of the primary alignment of the 326 # `(readNumber+1)%numberReads` read in the fragment. It replaces 327 # mate position and mate strand in SAM. 328 # already known reference. For now, represents a genomic position as a 329 # reference name, a base number on that reference (0-based), and a 330 # determination of forward or reverse strand. 331 "position": "A String", # The 0-based offset from the start of the forward strand for that reference. 332 "reverseStrand": True or False, # Whether this position is on the reverse strand, as opposed to the forward 333 # strand. 334 "referenceName": "A String", # The name of the reference in whatever reference set is being used. 335 }, 336 }, 337 ], 338 }</pre> 339 </div> 340 341 <div class="method"> 342 <code class="details" id="search_next">search_next(previous_request, previous_response)</code> 343 <pre>Retrieves the next page of results. 344 345 Args: 346 previous_request: The request for the previous page. (required) 347 previous_response: The response from the request for the previous page. (required) 348 349 Returns: 350 A request object that you can call 'execute()' on to request the next 351 page. Returns None if there are no more items in the collection. 352 </pre> 353 </div> 354 355 </body></html>