Skip to content

Does SMI-TED model support SMILES with stereochemical information? #59

@dongfeicui

Description

@dongfeicui

Hi team,

I noticed that in the function normalize_smiles (at load.py#L41 ), the parameter isomericis set to False by default:

def normalize_smiles(smi, canonical=True, isomeric=False):
    try:
        normalized = Chem.MolToSmiles(
            Chem.MolFromSmiles(smi),
            canonical=canonical,
            isomericSmiles=isomeric
        )
    except:
        normalized = None
    return normalized

As far as I understand, this means stereochemical information in SMILES (such as @, /, or ) will be lost during normalization.

I'd like to confirm:

  1. Is the current SMI-TED model designed not to support SMILES with stereochemical information?

  2. If that's the case, is there any plan or recommended way to handle isomeric SMILES (eg, enabling isomeric=Trueduring preprocessing)?

Thank you for your time and clarification!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions