Chemistry Informatics

Software to draw chemical structures

Plugins:

  • OOChemistry – JChemPaint plugin for OpenOffice, which allows to embed chemical molecules into ODF.
  • Chem4Word – Chemistry Add-in for Microsoft Word, which allows to tag chemical entities and change the representation (2D, common name, formular, …).

Cheminformatics toolkits

    • Facile management of molecules, atoms, bonds, and conformers
    • Conformational and frame-of-reference coordinate transformations
    • Maximum common substructure and exact substructure searching
    • Extremely fast 2D similarity using LINGOS
    • Perception of aromaticity with multiple models
    • Chemical reaction parsing and processing
    • Tetrahedral and E/Z stereochemistry recognition
    • Ring perception and Kekulization
    • Molecular normalization and canonicalization
    • Multiconformer molecule handling
    • Support for residues and bases
    • Ability to store and recall generic primitives or user-defined objects on molecules, atoms, bonds, or conformers
    • Multiple file format handling: robust reading and specification-compliant writing of: SMILES, SLN, SDF, MOL, MOL2, PDB, FASTA, MOPAC, MacroModel, XYZ, CCP4, XPLOR, and OEBinary.
    • Available in C++, Python & Java
    • Graph based structure to modify molecular structures
    • Classes for getting the aromatic flags for atoms and bonds
    • Classes for getting the hybridisation of atoms
    • Descriptor calculation classes (I-State, E-State, Burden, …)
    • SMiles ARbitrary Target Specification (SMARTS) substructure search
    • Base classes for reading and writing molecular file formats
      • Support for SMILES, Chemical Markup Language (CML), CACTVS's clear text format (CTX), POVRay export (including aromatic rings)
    • Atom and bond properties classes (including import and export filter)
    • Processes / External processes and process decision filters
    • Regression module using Neural Networks (JavaNNS)
    • Regression module using Support Vector Machines
    • JOELib-Matlab connection, e.g. for feature extraction
    • External processing modules for 3D structure generation with Corina and descriptor calculation with Petra (especially atom and bond property descriptors).
    • Database module checking for duplicate molecules
    • Available in Java
  • CDK:
    • 2D rendering (see also Renderer Tutorial, Chemblaics Blog by Egon Willighagen and this plus this maillist posts for more examples)
      • JChemPaint 2D diagram editor
      • Structure Diagram Layout
    • 3D Rendering
      • integration with Jmol
    • Input/Output
      • CML, SMILES parsing/generation, MDL Molfile support (limited), InChI (via JNI bridge), readers for XYZ, ShelX, HIN, GhemicalMM, Mol2
      • interface to OpenBabel (via command line)
      • rule based IUPAC name parser
    • Virtual Screening
      • molecular, atomic and bond descriptors
        • LogP, TPSA, Rule-of-Five, many more
        • Gasteiger-Marsili charges (sigma *and* pi)
      • interface to R and Weka for modelling
      • path-based Fingerprinter
    • Modelling
      • 3D model builder
      • atom typing
      • MMFF94 force field
      • Kabsch alignment
    • Chemical Graphs
      • isomorphism detection
      • maximal common substructure search
      • substructure searching (SMARTS like)
      • ring searches (Smallest Set of Smallest Rings (SSSR), all rings)
    • Properties
      • NMR prediction
    • Structure Generation
      • deterministic generator
      • stochastic generators (genetic algorithm- and simulated annealing- based)
    • BioJava interface
    • Protein Structures
      • PDB reading
      • active site detection
      • sequence to connectivity table

Bio Events

Learning material

Blogs and forums

Search engines and databases

Text extraction and analysis

Other

  • Which Genes Can Be Patented? (isolated DNA could not be patented, but that synthetic DNA created in the laboratory — complementary DNA, or cDNA — could be protected under the patent laws)
  • Bio-Linux for bioinformatics workstations
  • Bioclipse – a Rich Client for the Life Sciences
    • Features:
      • The management, analysis, and visualization of chemical structures and related information
      • The management and analysis of biological sequences (DNA, RNA, and proteins).
      • Pharmacological research and drug discovery.
      • Data analysis engine based on the statistical language R.
programming/chemistry.txt · Last modified: 2015/03/10 10:11 by dmitry
 
 
Recent changes RSS feed Driven by DokuWiki