ReactionCode: Format for Reaction Searching, Analysis, Classification, Transform, and Encoding/Decoding
In the past two decades a lot of different formats for molecules and reactions have been created. These formats were mostly developed for the purposes of identifiers, representation, classification, analysis and data exchange. A lot of efforts ha ve been made on molecule formats but only few for reactions where the endeavors have been made mostly by companies leading to proprietary formats. Here, we developed a new open-source format which allows to encode and decode a reaction into multi-layers machine readable code, which aggregates reactants and products into a condensed graph of reaction (CGR). This format is flexible and can be used in a context of reaction similarity searching and classification. It is also designed for database organization, machine learning applications and as a new transform reaction language.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Due to technical limitations, full-text HTML conversion of this manuscript could not be completed. However, the manuscript can be downloaded and accessed as a PDF.
This is a list of supplementary files associated with this preprint. Click to download.
Posted 28 Sep, 2020
On 03 Dec, 2020
Received 16 Oct, 2020
On 16 Oct, 2020
On 05 Oct, 2020
Received 02 Oct, 2020
On 26 Sep, 2020
Invitations sent on 25 Sep, 2020
On 24 Sep, 2020
On 23 Sep, 2020
On 23 Sep, 2020
On 21 Sep, 2020
ReactionCode: Format for Reaction Searching, Analysis, Classification, Transform, and Encoding/Decoding
Posted 28 Sep, 2020
On 03 Dec, 2020
Received 16 Oct, 2020
On 16 Oct, 2020
On 05 Oct, 2020
Received 02 Oct, 2020
On 26 Sep, 2020
Invitations sent on 25 Sep, 2020
On 24 Sep, 2020
On 23 Sep, 2020
On 23 Sep, 2020
On 21 Sep, 2020
In the past two decades a lot of different formats for molecules and reactions have been created. These formats were mostly developed for the purposes of identifiers, representation, classification, analysis and data exchange. A lot of efforts ha ve been made on molecule formats but only few for reactions where the endeavors have been made mostly by companies leading to proprietary formats. Here, we developed a new open-source format which allows to encode and decode a reaction into multi-layers machine readable code, which aggregates reactants and products into a condensed graph of reaction (CGR). This format is flexible and can be used in a context of reaction similarity searching and classification. It is also designed for database organization, machine learning applications and as a new transform reaction language.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Due to technical limitations, full-text HTML conversion of this manuscript could not be completed. However, the manuscript can be downloaded and accessed as a PDF.