1. Repeated DNA Sequences
The DNA sequence is composed of a series of nucleotides abbreviated as 'A', 'C', 'G', and 'T'
- For example,
"ACGAATTCCG"is a DNA sequence.
When studying DNA, it is useful to identify repeated sequences within the DNA.
Given a string s that represents a DNA sequence, return all the 10-letter-long sequences (substrings) that occur more than once in a DNA molecule. You may return the answer in any order.
Example 1:
Input: s = "AAAAACCCCCAAAAACCCCCCAAAAAGGGTTT" Output: ["AAAAACCCCC","CCCCCAAAAA"]
Example 2:
Input: s = "AAAAAAAAAAAAA"
Output: ["AAAAAAAAAA"]Solution Overview
Fixed Sliding Window: You iterate through the DNA string with a window of fixed size . This ensures you examine every possible 10-letter substring.
Hash Map for Counting: A hash map (or dictionary) is used to keep track of the frequency of each 10-letter substring encountered.
Result Collection: A list or set is used to store the repeated sequences.
| solution overview code |
Comments
Post a Comment