marsi.io package

Submodules

marsi.io.bigg module

marsi.io.build_database module

marsi.io.build_database.build_database(data, data_dir, with_zinc=True, session=<sqlalchemy.orm.session.Session object>)[source]

Builds then Molecules database. It requires that the input files have been downloaded.

See also

Initialization()

Parameters:
  • data (module) – The marsi.io.data module
  • data_dir (str) – The path to where data is stored.
marsi.io.build_database.upload_chebi_entries(chebi_structures_file, chebi_data, i=0, session=<sqlalchemy.orm.session.Session object>, keys=None)[source]

Import ChEBI data

marsi.io.build_database.upload_drugbank_entries(drugbank_structures_file, drugbank_data, i=0, session=<sqlalchemy.orm.session.Session object>, keys=None)[source]

Import DrugBank

marsi.io.build_database.upload_kegg_entries(kegg_mol_files_dir, kegg_data, i=0, session=<sqlalchemy.orm.session.Session object>, keys=None)[source]

Import KEGG

marsi.io.build_database.upload_pubchem_entries(pubchem_sdf_files_dir, pubchem_data, i=0, session=<sqlalchemy.orm.session.Session object>, keys=None)[source]

Import PubChem

marsi.io.build_database.upload_zinc_entries(zinc_data_file, i=0, session=<sqlalchemy.orm.session.Session object>, keys=None)[source]

Add ZINC

marsi.io.data module

marsi.io.db module

class marsi.io.db.Reference(**kwargs)[source]

Bases: sqlalchemy.ext.declarative.api.Base

Methods

add_reference(database, accession[, session])
get(database, accession[, session])
to_dict()
id
database
accession
classmethod get(database, accession, session=<sqlalchemy.orm.session.Session object>)[source]
classmethod add_reference(database, accession, session=<sqlalchemy.orm.session.Session object>)[source]
to_dict()[source]
class marsi.io.db.Metabolite(**kwargs)[source]

Bases: sqlalchemy.ext.declarative.api.Base

Attributes

volume

Methods

calc_solubility()
dump()
fingerprint([fingerprint_format])
from_molecule(molecule, references, synonyms)
from_references(references[, session])
get(inchi_key[, session]) Retrieves a metabolite using the InChI Key.
molecule([library, get3d])
restore(dump[, session])
to_dict()
validate_inchi_key(key, inchi_key)
id
inchi_key
inchi
analog
formula
num_atoms
num_bonds
num_rings
sdf
references
synonyms
fingerprints

A descriptor that presents a read/write view of an object attribute.

validate_inchi_key(key, inchi_key)[source]
classmethod get(inchi_key, session=<sqlalchemy.orm.session.Session object>)[source]

Retrieves a metabolite using the InChI Key.

Parameters:
  • inchi_key (str) – A valid InChi Key.
  • session (sqlalchemy.orm.session.Session) – A database session.
Returns:

Return type:

Metabolite

Raises:

KeyError – If the InChI Key is not available.

classmethod from_references(references, session=<sqlalchemy.orm.session.Session object>)[source]
classmethod from_molecule(molecule, references, synonyms, analog=False, session=<sqlalchemy.orm.session.Session object>, first_time=False)[source]
calc_solubility()[source]
volume
molecule(library='openbabel', get3d=True)[source]
fingerprint(fingerprint_format='maccs')[source]
to_dict()[source]
dump()[source]
classmethod restore(dump, session=<sqlalchemy.orm.session.Session object>)[source]
class marsi.io.db.Database[source]

Bases: object

metabolites = <marsi.io.db.CollectionWrapper object>

marsi.io.enrichment module

marsi.io.enrichment.map_uniprot_from_pdb_ids(pdb_ids)[source]
marsi.io.enrichment.find_chebi_id(metabolite)[source]

Queries ChEBI using InChI Key.

Parameters:metabolite (Metabolite) –
Returns:With tuples of (id, name)
Return type:list
marsi.io.enrichment.find_pubchem_id(metabolite)[source]

Queries PubChem Compound using InChI Key.

Parameters:metabolite (Metabolite) –
Returns:With tuples of (id, name)
Return type:list
marsi.io.enrichment.find_best_chebi_structure(entity)[source]

marsi.io.parsers module

marsi.io.parsers.parse_kegg_brite(brite_file)[source]
marsi.io.parsers.parse_chebi_data(chebi_names_file, chebi_vertice_file, chebi_relation_file)[source]
marsi.io.parsers.parse_pubchem(summary_file)[source]

marsi.io.plots module

marsi.io.plots.summary_plot(summary, dbs=['chebi', 'kegg', 'drugbank', 'pubchem'])[source]
marsi.io.plots.venn_plot(summary, dbs=['chebi', 'kegg', 'drugbank', 'pubchem'])[source]

marsi.io.retriaval module

marsi.io.retriaval.retrieve_bigg_reactions(dest='/home/travis/.marsi/data/bigg_models_reactions.txt')[source]

Retrieves bigg reactions file

marsi.io.retriaval.retrieve_bigg_metabolites(dest='/home/travis/.marsi/data/bigg_models_metabolites.txt')[source]

Retrieves bigg metabolites file

marsi.io.retriaval.retrieve_drugbank_open_structures(db_version='5.0.3', dest='/home/travis/.marsi/data/drugbank_open_structures.sdf')[source]

Retrieves Drugbank Open Structures.

Parameters:db_version (str) – The version of drugbank to retrieve
marsi.io.retriaval.retrieve_drugbank_open_vocabulary(db_version='5.0.3', dest='/home/travis/.marsi/data/drugbank_open_vocabulary.csv')[source]

Retrieves Drugbank Open Vocabulary.

Parameters:db_version (str) – The version of drugbank to retrieve
marsi.io.retriaval.retrieve_chebi_structures(dest='/home/travis/.marsi/data/chebi_lite_3star.sdf')[source]

Retrieves ChEBI sdf (lite version).

marsi.io.retriaval.retrieve_chebi_names(dest='/home/travis/.marsi/data/chebi_names_3star.txt')[source]

Retrieves ChEBI names.

marsi.io.retriaval.retrieve_chebi_relation(dest='/home/travis/.marsi/data/chebi_relation_3star.tsv')[source]

Retrieves ChEBI relation data.

marsi.io.retriaval.retrieve_chebi_vertice(dest='/home/travis/.marsi/data/chebi_vertice_3star.tsv')[source]

Retrieves ChEBI vertice data.

marsi.io.retriaval.retrieve_kegg_brite(dest='/home/travis/.marsi/data/kegg_brite_08310.keg')[source]

Retrieves KEGG Brite 08310 (Target-based Classification of Drugs)

marsi.io.retriaval.retrieve_pubchem_mol_files(pubchem_ids, dest='/home/travis/.marsi/data')[source]

Retrieves SDF Files from PubChem.

marsi.io.retriaval.retrieve_kegg_mol_files(kegg, dest='/home/travis/.marsi/data')[source]

Retrieves KEGG MOL Files using KEGG REST API.

marsi.io.retriaval.retrieve_zinc_properties(dest='/home/travis/.marsi/data/zinc_16_prop.tsv')[source]

Retrieves ZINC properties file: “All Clean As Subset #6, but without ‘yuck’ compounds”

marsi.io.retriaval.retrieve_zinc_structures(dest='/home/travis/.marsi/data/zinc_16.sdf.gz')[source]

Retrieves ZINC structures file: “All Clean As Subset #6, but without ‘yuck’ compounds”

Module contents

marsi.io.write_excel_file(data_frame, path, molecules)[source]