masskit package¶

Subpackages¶

Submodules¶

masskit.data module¶

class masskit.data.Alphabet(chars, encoding=None, mask=False, missing=255)¶

Bases: object

decode(x)¶: decode index array, x, to byte string of this alphabet

encode(x, count=-1)¶: encode a string into alphabet indices

get_kmer(h, k)¶: retrieve byte string of length k decoded from integer h

unpack(h, k)¶: unpack integer h into array of this alphabet with length k

class masskit.data.MSDictionary¶

Bases: object

create_df()¶

create_dictionary(fill_value=None)¶

create a dictionary array

Parameters:: fill_value – pandas doesn’t allow None as a category value, so use the fill value

create_id2row()¶: create a dictionary that maps the unique id field to a row number

class masskit.data.SDM12(mask=False)¶

Bases: Alphabet

A D KER N TSQ YF LIVM C W H G P See https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2732308/#B33 “Reduced amino acid alphabets exhibit an improved sensitivity and selectivity in fold assignment” Peterson et al. 2009. Bioinformatics.

class masskit.data.Uniprot21(mask=False)¶: Bases: Alphabet

masskit package¶

Subpackages¶

Submodules¶

masskit.data module¶

Module contents¶

Table of Contents

This Page