首页 >
> 详细

实现分子计算器，练习Java三方类库的使用。

This is the last practical exercise and will continue over the remaining weeks of the course.

In this practical you will implement a real molecular similarity method

Ultrafast shape recognition to search compound databases for similar molecular shapes

So this problem involves reading from a file one reference molecule calculating a descriptor for it, then reading a series of molecules from a second file, computing the descriptor for each molecule and then quantifying the difference between it and the reference. At the end of the run the program should report the closest molecule and the magnitude of its difference to the reference. All files will be in SD format and hydrogens should be completely ignored in the procedure

The descriptor we will calculate consists of 4 triples of numbers. Each triple consists of 3 statistical measures of distances from a point.

The measures are

- The mean distance from the point (sum of all distances divided by number of distances)
- The variance of this distance (sum of the squares of distances - mean all divided by number of distances minus 1)
- The skew of this distance (sum of the cubes of (distances - mean) / standard dev all divided by number of distances. The standard deviation is the square root of the variance.

The four points we use to calculate these from are

- The centre of gravity
- the closest atom position to the COG
- The furthest atom position from the COG
- The furthest atom position from point 3 above.

To calculate the difference between any 12 double set and another simply do the equivalent of a distance calculation but over all 12 numbers.

Remember we know how to read SDfiles from a previous practical, however here is a reminder

In order to access the CDK library you will need some import statements

1 |
import org.openscience.cdk.CDKConstants; |

import org.openscience.cdk.interfaces.*;

To read a single SD file you could use something like

1 |
IteratingMDLReader MDLReader = new IteratingMDLReader(new FileInputStream(RefFile), DefaultChemObjectBuilder.getInstance()); |

To read a sequence of files from an SD file

1 |
MDLReader = new IteratingMDLReader(new FileInputStream(ScrFile), DefaultChemObjectBuilder.getInstance()); |

To get the name of a Molecule (here called m1) object

1 |
Name = new String(String.valueOf(m1.getProperty(CDKConstants.TITLE))); |

To get its number of atoms

1 |
int natoms = m1.getAtomCount(); |

you can get each atom in a molecule by

1 |
IAtom myatom = m1.getAtom(i); |

Where i is the ith atom

You can get the chemical symbol from each atom

1 |
String s1 = myatom.getSymbol(); |

You can get the coordinates as a Point3d object by

1 |
Point3d mypoint = myatom.getPoint3d(); |

(to use Point3d class you have to import `javax.vecmath.Point3d`

)

The Point3d class has a method called distance which returns the distance between the instance calling and its argument so

1 |
Point3d a,b; |

In addition to the usual criteria of Functionality, readability, comments and a readme file, I request that you prepare a document called plan.txt in which you write a simple logic plan for the program.

In order that you don’t get bogged down in the statistics I have given you a set of example methods to calculate mean, variance and skew.

联系我们

- QQ：99515681
- 邮箱：99515681@qq.com
- 工作时间：8:00-23:00
- 微信：codehelp

- Stat7017 Final Project 2020-03-29
- Cs3214 Spring 2020 Project 1 - “Extens 2020-03-29
- Co3090/Co7090 Distributed Systems And ... 2020-03-29
- Hw2: Sql 2020-03-29
- Hw1: 5 Points Entity-Relational (Er) 2020-03-29
- Math 104A Homework #3 2020-03-29
- Comp 250 Assignment 2 2020-03-29
- Cs 570课程作业代写、Program作业代做、C++语言作业代写、代做j 2020-03-29
- Comp-424作业代做、代写intelligence作业、Python，C 2020-03-29
- Database作业代做、代写cap Theorem作业、代写java程序语 2020-03-29
- 代做structure作业、代写python，Java,C++编程语言作业、 2020-03-29
- 代写sta238留学生作业、代做python，C++程序语言作业、Java编 2020-03-29
- Csc148留学生作业代做、代写computer Science作业、Pyt 2020-03-29
- Cmpt 365作业代做、代写programming作业、代做java，C+ 2020-03-29
- Fc712留学生作业代做、代写programming课程作业、代写pytho 2020-03-28
- Algorithms作业代写、代做dataset课程作业、C++，Pytho 2020-03-28
- 代做data留学生作业、代写r编程设计作业、代做r语言作业、代写progra 2020-03-28
- Csci3130作业代写、代做uml留学生作业、Python，C++，Jav 2020-03-28
- Eece5644作业代做、Matlab语言作业代做、代写matlab程序设计 2020-03-28
- 代写comp9321作业、代做python编程设计作业、代写python语言 2020-03-28