hmm

integron_finder.hmm.read_hmm(replicon_id, prot_db, infile, cfg, evalue=1.0, coverage=0.5)[source]

Function that parse hmmer –out output and returns a pandas DataFrame filter output by evalue and coverage. (Being % of the profile aligned)

Parameters
  • replicon_id (str) – the id of the replicon

  • prot_db (integron_finder.prot_db.ProteinDB object.) – The protein database corresponding to the replicon translation

  • infile (str) – the hmm output (in tabulated format) to parse

  • cfg (integron_finder.config.Config object.) – the config

  • evalue (float) – filter out hits with evalue greater tha evalue.

  • coverage (float) – filter out hits with coverage under coverage (% of the profile aligned)

Returns

data Frame with columns:

”Accession_number”, “query_name”, “ID_query”, “ID_prot”, “strand”, “pos_beg”, “pos_end”, “evalue”
each row correspond to a hit.

Return type

a pandas.DataFrame

integron_finder.hmm.scan_hmm_bank(path)[source]
Parameters

path (str) –

  • if the path is a dir: return all files ending with .hmm in the dir

  • if the path is a file: parse the file, each line must be an expression (glob) pointing to hmm files

Returns

lists of hmm files to consider for annotation

Return type

list of str

Raises

IOError – if the path does not exists