Dalke Scientific Software: More science. Less time. Products

See Assignment #1 for the instructions of how to submit this assignment. The short version is to send me a tar or zip archive of a directory named "assignment3" with your name and answers in the README file. You will also include an image file in your submission.

ASSIGNMENT #3

Question 1

Warfarin is a blood thinner which is also a rat poison.

Of those use the compound with CID 6691. ("CID" means "Compound ID".)

Bring up CID: 942. It's a highly toxic alkaloid.

CID: 445354 is the part of the rhodopsin in your eye. It's similar to a vitamin compound.

And one last one. Again, use PubChem for this; don't search the PDB directly.

Question 2

Here's a set of depictions. Create the SMILES string for each one. You might use the OpenEye depict demo or the Daylight depict demo to test and compare your SMILES strings.

What names does the OpenEye namer generate for each structure?

If you would like additional exercises then try the Daylight SMILES practice page. These are optional and there is no need to send me any answers.

Question 3

These questions all use SMARTS. To test your answers use the OpenEye depict page. I found that using the "COB" (for "color on black") option instead of "BOW" (for "black on white") gives a more readable picture. You may want to toggle the "cp2txt" option which copies the input text into the results window.

Given the SMILES string c1ncccc1OC:

  1. How many times does the SMARTS pattern "C" match?
  2. How many times does the pattern "c" match?
  3. How many times does "[#6]" match?
  4. In English describe what the "[#6]" matches.
  5. What SMARTS pattern matches the aromatic nitrogen?
  6. How many times does "Occ" match?
If you would like additional exercises then try the Daylight SMARTS practice page. These are optional and there is no need to send me any answers.

Question 4

Here's a SMILES string CC(C)C=CCCCCC(=O)NCC1=CC(=C(C=C1)O)OC for a compound I like.



Copyright © 2001-2013 Andrew Dalke Scientific AB