Hub
Pricing About
WorkflowWorkflow

SMARTS Matching

RDKitSMARTS
ja42 profile image
Draft Latest edits on 
Feb 8, 2024 8:50 PM
Drag & drop
Like
Download workflow
Workflow preview
Overview: This workflow demonstrates how to perform some SMARTS filtering and highlighting operations using the RDKit community nodes. In particular, 3 different SMARTS operations are shown: 1. View filtered molecules from single SMARTS query (top). This process takes a user-provided SMARTS query and filters the provided SMILES file based on the query. Substructures which match the query are highlighted. 2. View molecules filtered by SMARTS file (middle). This process performs filtering based on the given SMILES file and SMARTS file. The RDKit Molecule Substructure Filter node can be configured to specify how the filtering should be done. 3. Count # of matches from SMARTS file (bottom). This process counts the number of substructure matches found in the given SMILES file, using the SMARTS file as input queries. Each query will be a column in the output table, with column/row entries indicating the number of matches between a given query/molecule (SMILES). Load Data: Example data is provided, although the user may change the path to the SMILES or SMARTS file (in the "Load data" section) to specify the file they'd like to use. The only requirements are that the SMILES table contains a column titled "SMILES" with SMILES entries and column "Name" giving the molecule name or id. Similarly, the SMARTS table should have a column titled "SMARTS" with SMARTS queries and a column "Name" giving the name or id of the query. Note that you may need to adjust the Column Renamer and CSV Reader nodes to account for differences in formatting. Additional Resources: If you'd like to use a command-line interface with similar (and more robust/faster) functionality, please visit RDKit Tools repository linked in external resources. Acknowledgement: Both the SMILES and SMARTS data used in this example workflow are taken from public sources. The SMILES file is based on the Tox21 project, and is taken from the GitHub repository constructed by Guillaume Lambard (see external resources). The SMARTS file is based on PAINS filters (Baell and Holloway) and it taken from the link provided in external resources.

External resources

  • PAINS (File Used)
  • PAINS Paper (Baell and Holloway)
  • Tox21 (File Used)
  • Tox21 (About)
  • RDKit Tools GitHub
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 5.2.1
  • Go to item
    KNIME Base Chemistry Types & NodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.0

    knime
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.1

    knime
  • Go to item
    KNIME JavaScript ViewsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.1

    knime
  • Go to item
    KNIME JavasnippetTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.0

    knime
  • Go to item
    KNIME Python IntegrationTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.1

    knime
  • Go to item
    KNIME Quick FormsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.1

    knime
  • Go to item
    KNIME SVG SupportTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.0

    knime
  • Go to item
    RDKit Nodes FeatureTrusted extension

    Novartis

    Version 4.9.1

    manuelschwarze

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits