Molecule
- class Molecule(atoms, bonds, position_matrix)[source]
Bases:
object
An abstract base class for molecules.
Notes
You might notice that some of the methods of this abstract base class are implemented. This is purely for convenience when implementing subclasses. The implemented public methods are simply default implementations, which can be safely ignored or overridden, when implementing subclasses. Any private methods are implementation details of these default implementations.
Examples
Aligning a Molecule with a Vector
You want to rotate a molecule, such that it is aligned along with a specific direction.
import stk import numpy as np molecule1 = stk.BuildingBlock('CCCCC') # Align molecule1 along x-axis. molecule1 = molecule1.with_rotation_between_vectors( start=molecule1.get_direction(), target=np.array([1., 0., 0.]), origin=molecule1.get_centroid(), ) molecule2 = stk.ConstructedMolecule( topology_graph=stk.polymer.Linear( building_blocks=( stk.BuildingBlock( smiles='BrCCCBr', functional_groups=(stk.BromoFactory(), ), ), ), repeating_unit='A', num_repeating_units=15, ), ) # Align molecule2 along the [1, 4, -3] vector. molecule2 = molecule2.with_rotation_between_vectors( start=molecule2.get_direction(), target=np.array([1., 4., -3.]), origin=molecule2.get_centroid(), )
Aligning a Molecule along a Plane
You want to place the benzene flat along the xy plane.
import stk import numpy as np benzene = stk.BuildingBlock('c1ccccc1') benzene = benzene.with_rotation_between_vectors( start=benzene.get_plane_normal(), target=np.array([0., 0., 1.]), origin=benzene.get_centroid(), )
Methods
clone
()Return a clone.
get_atomic_positions
([atom_ids])Yield the positions of atoms.
get_atoms
([atom_ids])Yield the atoms in the molecule, ordered by id.
Yield the bond in the molecule.
Map the id of each atom to its id under canonical ordering.
get_centroid
([atom_ids])Return the centroid.
get_direction
([atom_ids])Return a vector of best fit through the atoms.
get_maximum_diameter
([atom_ids])Return the maximum diameter.
Return the number of atoms in the molecule.
Return the number of bonds in the molecule.
get_plane_normal
([atom_ids])Return the normal to the plane of best fit.
Return a matrix holding the atomic positions.
Return an
rdkit
representation.Return a clone, with canonically ordered atoms.
with_centroid
(position[, atom_ids])Return a clone with its centroid at position.
with_displacement
(displacement)Return a displaced clone.
with_position_matrix
(position_matrix)Return a clone with atomic positions set by position_matrix.
with_rotation_about_axis
(angle, axis, origin)Return a rotated clone.
with_rotation_between_vectors
(start, target, ...)Return a rotated clone.
with_rotation_to_minimize_angle
(start, ...)Return a rotated clone.
with_structure_from_file
(path[, extension])Return a clone, with its structure taken from a file.
write
(path[, atom_ids])Write the structure to a file.
- get_atomic_positions(atom_ids=None)[source]
Yield the positions of atoms.
- get_centroid(atom_ids=None)[source]
Return the centroid.
- Parameters
atom_ids (
Union
[int
,Iterable
[int
],None
]) – The ids of atoms which are used to calculate the centroid. Can be a singleint
, if a single atom is to be used, orNone
if all atoms are to be used.- Return type
- Returns
The centroid of atoms specified by atom_ids.
- Raises
ValueError – If atom_ids has a length of
0
.
- get_direction(atom_ids=None)[source]
Return a vector of best fit through the atoms.
- Parameters
atom_ids (
Union
[int
,Iterable
[int
],None
]) – The ids of atoms which should be used to calculate the vector. Can be a singleint
, if a single atom is to be used, orNone
, if all atoms are to be used.- Return type
- Returns
The vector of best fit.
- Raises
ValueError – If atom_ids has a length of
0
.
- get_maximum_diameter(atom_ids=None)[source]
Return the maximum diameter.
This method does not account for the van der Waals radius of atoms.
- Parameters
atom_ids (
Union
[int
,Iterable
[int
],None
]) – The ids of atoms which are considered when looking for the maximum diameter. Can be a singleint
, if a single atom is to be used, orNone
, if all atoms are to be used.- Return type
- Returns
The maximum diameter in the molecule.
- Raises
ValueError – If atom_ids has a length of
0
.
- get_num_atoms()[source]
Return the number of atoms in the molecule.
- Return type
- Returns
The number of atoms in the molecule.
- get_num_bonds()[source]
Return the number of bonds in the molecule.
- Return type
- Returns
The number of bonds in the molecule.
- get_plane_normal(atom_ids=None)[source]
Return the normal to the plane of best fit.
- Parameters
atom_ids (
Union
[int
,Iterable
[int
],None
]) – The ids of atoms which should be used to calculate the plane. Can be a singleint
, if a single atom is to be used, orNone
, if all atoms are to be used.- Return type
- Returns
Vector orthonormal to the plane of the molecule.
- Raises
ValueError – If atom_ids has a length of
0
.
- get_position_matrix()[source]
Return a matrix holding the atomic positions.
- Return type
- Returns
The array has the shape
(n, 3)
. Each row holds the x, y and z coordinates of an atom.
- to_rdkit_mol()[source]
Return an
rdkit
representation.- Return type
Mol
- Returns
The molecule in
rdkit
format.
- with_canonical_atom_ordering()[source]
Return a clone, with canonically ordered atoms.
- Return type
- Returns
The clone.
- with_centroid(position, atom_ids=None)[source]
Return a clone with its centroid at position.
- Parameters
position (
ndarray
) – This array holds the position on which the centroid of the clone is going to be placed.atom_ids (
Union
[int
,Iterable
[int
],None
]) – The ids of atoms which should have their centroid set to position. Can be a singleint
, if a single atom is to be used, orNone
, if all atoms are to be used.
- Return type
- Returns
A clone with its centroid at position.
- with_position_matrix(position_matrix)[source]
Return a clone with atomic positions set by position_matrix.
- with_rotation_about_axis(angle, axis, origin)[source]
Return a rotated clone.
The clone is rotated by angle about axis on the origin.
- with_rotation_between_vectors(start, target, origin)[source]
Return a rotated clone.
The rotation is equal to a rotation from start to target.
Given two direction vectors, start and target, this method applies the rotation required transform start to target onto the clone. The rotation occurs about the origin.
For example, if the start and target vectors are 45 degrees apart, a 45 degree rotation will be applied to the clone. The rotation will be along the appropriate direction.
The great thing about this method is that you as long as you can associate a geometric feature of the molecule with a vector, then the clone can be rotated so that this vector is aligned with target. The defined vector can be virtually anything. This means that any geometric feature of the molecule can be easily aligned with any arbitrary direction.
- with_rotation_to_minimize_angle(start, target, axis, origin)[source]
Return a rotated clone.
The clone is rotated by the rotation required to minimize the angle between start and target.
Note that this function will not necessarily overlay the start and target vectors. This is because the possible rotation is restricted to the axis.
- Parameters
- Return type
- Returns
A rotated clone.
- Raises
ValueError – If target has a magnitude of 0. In this case it is not possible to calculate an angle between start and target.
- with_structure_from_file(path, extension=None)[source]
Return a clone, with its structure taken from a file.
Multiple file types are supported, namely:
.mol
,.sdf
- MDL V2000 and V3000 files.xyz
- XYZ files.mae
- Schrodinger Maestro files.coord
- Turbomole files.pdb
- PDB files
- Parameters
- Return type
- Returns
A clone with atomic positions found in path.
- write(path, atom_ids=None)[source]
Write the structure to a file.
This function will write the format based on the extension of path.
.mol
,.sdf
- MDL V3000 MOL file.xyz
- XYZ file.pdb
- PDB file
- Parameters
path (
str
) – The path to which the molecule should be written.atom_ids (
Union
[int
,Iterable
[int
],None
]) – The atom ids of atoms to write. Can be a singleint
, if a single atom is to be used, orNone
, if all atoms are to be used. If you use this parameter, the atom ids in the file may not correspond to the atom ids in the molecule.
- Return type
- Returns
The molecule.