masskit package

Subpackages

Submodules

masskit.data module

class masskit.data.Alphabet(chars, encoding=None, mask=False, missing=255)

Bases: object

decode(x)

decode index array, x, to byte string of this alphabet

encode(x, count=-1)

encode a string into alphabet indices

get_kmer(h, k)

retrieve byte string of length k decoded from integer h

unpack(h, k)

unpack integer h into array of this alphabet with length k

class masskit.data.MSDictionary

Bases: object

create_df()
create_dictionary(fill_value=None)

create a dictionary array

Parameters:

fill_value – pandas doesn’t allow None as a category value, so use the fill value

create_id2row()

create a dictionary that maps the unique id field to a row number

class masskit.data.SDM12(mask=False)

Bases: Alphabet

A D KER N TSQ YF LIVM C W H G P See https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2732308/#B33 “Reduced amino acid alphabets exhibit an improved sensitivity and selectivity in fold assignment” Peterson et al. 2009. Bioinformatics.

class masskit.data.Uniprot21(mask=False)

Bases: Alphabet

Module contents