Digital access to chemical journals resulted in a vast array of molecular information that is now available in the supplementary material files in PDF format. However, extracting this molecular information, generally from a PDF document format is a daunting task. Here we present an approach to harvest 3D molecular data from the supporting information of scientific research articles that are normally available from publisher's resources. In order to demonstrate the feasibility of extracting truly computable molecules from PDF file formats in a fast and efficient manner, we have developed a Java based application, namely ChemEngine. This program recognizes textual patterns from the supplementary data and generates standard molecular structure data (bond matrix, atomic coordinates) that can be subjected to a multitude of computational processes automatically. The methodology has been demonstrated via several case studies on different formats of coordinates data stored in supplementary information files, wherein ChemEngine selectively harvested the atomic coordinates and interpreted them as molecules with high accuracy. The reusability of extracted molecular coordinate data was demonstrated by computing Single Point Energies that were in close agreement with the original computed data provided with the articles. It is envisaged that the methodology will enable large scale conversion of molecular information from supplementary files available in the PDF format into a collection of ready- to- compute molecular data to create an automated workflow for advanced computational processes. Software along with source codes and instructions available at http://ift.tt/2islzYp abstract.
http://ift.tt/2ij5Ihy
Αρχειοθήκη ιστολογίου
-
►
2023
(256)
- ► Φεβρουαρίου (140)
- ► Ιανουαρίου (116)
-
►
2022
(1695)
- ► Δεκεμβρίου (78)
- ► Σεπτεμβρίου (142)
- ► Φεβρουαρίου (155)
-
►
2021
(5507)
- ► Δεκεμβρίου (139)
- ► Σεπτεμβρίου (333)
- ► Φεβρουαρίου (628)
-
►
2020
(1810)
- ► Δεκεμβρίου (544)
- ► Σεπτεμβρίου (32)
- ► Φεβρουαρίου (28)
-
►
2019
(7684)
- ► Δεκεμβρίου (18)
- ► Σεπτεμβρίου (53)
- ► Φεβρουαρίου (2841)
- ► Ιανουαρίου (2803)
-
►
2018
(31838)
- ► Δεκεμβρίου (2810)
- ► Σεπτεμβρίου (2870)
- ► Φεβρουαρίου (2420)
- ► Ιανουαρίου (2395)
-
►
2017
(31987)
- ► Δεκεμβρίου (2460)
- ► Σεπτεμβρίου (2605)
- ► Φεβρουαρίου (2785)
- ► Ιανουαρίου (2830)
-
▼
2016
(5308)
-
▼
Δεκεμβρίου
(2118)
-
▼
Δεκ 28
(256)
- Is routine audiometric testing necessary for child...
- Bilateral cochlear nerve absence in a 3 year old c...
- The effect of obesity, weight gain, and weight los...
- Microbes, allergic sensitization, and the natural ...
- AllergoOncology - The impact of Allergy in Oncolog...
- The Burden of Common Skin Diseases Assessed with t...
- Prevalence and socio-demographic correlates of phy...
- Outcome after protected full weightbearing treatme...
- Prevalence and socio-demographic correlates of phy...
- Outcome after protected full weightbearing treatme...
- MAM-6E7, MAM-3E7, MAM-6G7 (Antirecombinant Human T...
- Monoclonal Antibody Against Human GLRX3
- Preparation and Identification of Monoclonal Antib...
- Prevalence and socio-demographic correlates of phy...
- Prevalence and socio-demographic correlates of phy...
- Prevalence and socio-demographic correlates of phy...
- Prevalence and socio-demographic correlates of phy...
- Prevalence and socio-demographic correlates of phy...
- A patient preference study that evaluated fluticas...
- Change in nasal congestion index after treatment i...
- Prevalence of allergic sensitization to conifer po...
- Compressive optic neuropathy due to a large Onodi ...
- The effect of mupirocin- and fusidic acid‐nasal pa...
- Is a high-fiber diet able to influence ovalbumin-i...
- Skull base erosion and associated complications in...
- The role of simulation in teaching sinus surgery i...
- Association between vasomotor rhinitis and irritab...
- Concha bullosa mucocele: A case series and review ...
- ChemEngine: harvesting 3D chemical structures of s...
- Inelastic strain rate in the seismogenic layer of ...
- An adaptive random compressive partial sampling me...
- Degradation study of lindane by novel strains Kocu...
- ChemEngine: harvesting 3D chemical structures of s...
- Wage inequality, skill inequality, and employment:...
- Inelastic strain rate in the seismogenic layer of ...
- An adaptive random compressive partial sampling me...
- Degradation study of lindane by novel strains Kocu...
- The legacies of slavery in and out of Africa
- How Large are Earnings Penalties for Self-Employed...
- Wage rigidities and business cycle fluctuations: a...
- Gender unemployment gaps in the EU: blame the family
- Inelastic strain rate in the seismogenic layer of ...
- An adaptive random compressive partial sampling me...
- Degradation study of lindane by novel strains Kocu...
- Wage inequality, skill inequality, and employment:...
- The legacies of slavery in and out of Africa
- How Large are Earnings Penalties for Self-Employed...
- Wage rigidities and business cycle fluctuations: a...
- Gender unemployment gaps in the EU: blame the family
- Outcome after protected full weightbearing treatme...
- ChemEngine: harvesting 3D chemical structures of s...
- Outcome after protected full weightbearing treatme...
- ChemEngine: harvesting 3D chemical structures of s...
- Outcome after protected full weightbearing treatme...
- ChemEngine: harvesting 3D chemical structures of s...
- Outcome after protected full weightbearing treatme...
- ChemEngine: harvesting 3D chemical structures of s...
- Outcome after protected full weightbearing treatme...
- ChemEngine: harvesting 3D chemical structures of s...
- Improvement of speech perception in quiet and in n...
- TILs in Head and Neck Cancer: Ready for Clinical I...
- Sinonasal Renal Cell-Like Carcinoma: Case Report a...
- Biologic drug survival in Israeli psoriasis patients
- Periodontitis in oral pemphigus and pemphigoid: A ...
- Prospective studies on the routine use of a novel ...
- Diabetes Mellitus and the Skin
- Transitional cell carcinoma with extension of the ...
- Transitional cell carcinoma with extension of the ...
- Sinonasal Renal Cell-Like Carcinoma: Case Report a...
- TILs in Head and Neck Cancer: Ready for Clinical I...
- Pre-operative Assessment of Anatomical Position of...
- Transitional cell carcinoma with extension of the ...
- The Effects of Earphone Use and Environmental Lead...
- Organ of Corti and Stria Vascularis: Is there an I...
- The effect of obesity, weight gain, and weight los...
- Microbes, allergic sensitization, and the natural ...
- Transitional cell carcinoma with extension of the ...
- Complementary and Alternative Medicine for Atopic ...
- Postsurgical Infection: Administrative vs Registry...
- Gene Activity Predicts Progression of Systemic Scl...
- Why Do You Practice Medicine?
- Transitional cell carcinoma with extension of the ...
- Transitional cell carcinoma with extension of the ...
- Aflatoxin M 1 in human breast milk in southeastern...
- Transitional cell carcinoma with extension of the ...
- Does dental trauma in the primary dentition increa...
- Evaluation of ultrasonic and conventional surgical...
- IL-22 promotes Fas expression in oligodendrocytes ...
- Rituximab Treatment for Recalcitrant Dermatitis He...
- Ex Vivo Dermoscopy With Derm Dotting
- Demodectic Frost of the Ear
- Rethinking How We Select Dermatology Applicants
- Regional and State Differences in Melanoma Rates i...
- Indoor Tanning and Skin Cancer Risk Among Diverse ...
- Isolated Congenital Midline Upper Lip Sinus in A 5...
- Isolated Congenital Midline Upper Lip Sinus in A 5...
- Analysis of molecular networks and targets mining ...
- Analysis of molecular networks and targets mining ...
- News and Announcements
- Analysis of molecular networks and targets mining ...
-
▼
Δεκ 28
(256)
- ► Σεπτεμβρίου (877)
- ► Φεβρουαρίου (41)
- ► Ιανουαρίου (39)
-
▼
Δεκεμβρίου
(2118)
Αλέξανδρος Γ. Σφακιανάκης
ΩτοΡινοΛαρυγγολόγος
Αναπαύσεως 5
Άγιος Νικόλαος Κρήτη 72100
2841026182
6032607174
Τετάρτη 28 Δεκεμβρίου 2016
ChemEngine: harvesting 3D chemical structures of supplementary data from PDF files
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου