1. Repeated DNA Sequences

 The DNA sequence is composed of a series of nucleotides abbreviated as 'A''C''G', and 'T'


  • For example, "ACGAATTCCG" is a DNA sequence.

When studying DNA, it is useful to identify repeated sequences within the DNA.

Given a string s that represents a DNA sequence, return all the 10-letter-long sequences (substrings) that occur more than once in a DNA molecule. You may return the answer in any order.

 

Example 1:

Input: s = "AAAAACCCCCAAAAACCCCCCAAAAAGGGTTT"
Output: ["AAAAACCCCC","CCCCCAAAAA"]

Example 2:

Input: s = "AAAAAAAAAAAAA"
Output: ["AAAAAAAAAA"]

Solution Overview

  1. Fixed Sliding Window: You iterate through the DNA string with a window of fixed size . This ensures you examine every possible 10-letter substring.

  2. Hash Map for Counting: A hash map (or dictionary) is used to keep track of the frequency of each 10-letter substring encountered.

  3. Result Collection: A list or set is used to store the repeated sequences.

solution overview


code


Comments

Popular posts from this blog

Top K Elements

1. Valid palindrome