TY - JOUR AU - Prasad, Vinod PY - 2021/04/15 Y2 - 2024/03/28 TI - A Character Frequency based Approach to Search for Substrings of a Circular Pattern and its Conjugates in an Online Text JF - Computer Science JA - csci VL - 22 IS - 2 SE - Articles DO - 10.7494/csci.2021.22.2.3401 UR - https://journals.agh.edu.pl/csci/article/view/3401 SP - AB - <p>A fundamental problem in computational biology is to deal with circular patterns. The problem consists of finding the least certain length substrings of a pattern and its rotations in the database. In this paper, a novel method is presented to deal with circular patterns. The problem is solved using two incremental steps. First, an algorithm is provided that reports all substrings of a given linear pattern in an online text. Next, without losing efficiency, the algorithm is extended to process all circular rotations of the pattern. For a given pattern <em>P</em> of size <em>M</em>, and a text <em>T</em> of size <em>N</em>, the algorithm reports all locations in the text where a substring of <em>P<sub>c</sub></em> is found, where <em>P<sub>c</sub></em> is one of the rotations of <em>P</em>. For an alphabet size <em>σ</em>, using <em>O(M)</em> space, desired goals are achieved in an average <em>O(MN/σ)</em> time, which is <em>O(N)</em> for all patterns of length <em>M ≤ σ</em>. Traditional string processing algorithms make use of advanced data structures such as suffix trees and automaton. We show that basic data structures such as arrays can be used in the text processing algorithms without compromising the efficiency.</p> ER -