-
Notifications
You must be signed in to change notification settings - Fork 105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Segfault when iterating CDX file from USPTO downloads #1126
Comments
Please check it with Indigo 1.11.0. I wasn't able to reproduce the issue but I did investigation with valgrind and it indicates memory access issues during the loading of cdx from your example. These issues doesn't appear on 1.11.0. |
latest indigo pip package version and github release is 1.10.0 https://pypi.org/project/epam.indigo/ https://github.com/epam/Indigo/releases shall I then build it from 1.11.0 branch? Many thanks! |
It seems that 1.11.0-rc1 was published here: https://github.com/epam/Indigo/actions/runs/4731689107/jobs/8397946421 |
That's great, I didn't see it. I'll check that now, thanks! |
Please let me know if everything is ok. |
It still crashes for me. Some times it manages to parse the file and convert it to molfile but that might be 1 in 10. If I convert to smiles it seems to always crash now but this might be a completely random behaviour. from indigo import Indigo
indigo = Indigo()
for item in indigo.iterateCDXFile("US06174985-20010116-C00003.CDX"):
print(item.smiles())
|
I can run the code now without problems by using Python 3.11 ( I was using 3.10) Attached a Dockerfile with Python 3.10 that reproduces the issue. Container dies and stops dying if changing the Python version in the FROM statement. |
Sorry, it seems I'm still finding issues in Python 3.11 US06171768-20010109-C00056.zip from indigo import Indigo
indigo = Indigo()
for i in range (1, 1000):
for item in indigo.iterateCDXFile("US06171768-20010109-C00056.CDX"):
print(item.smiles())
|
I am working on the issue, and the fix will be available in Indigo 1.12.0-rc.1, which is scheduled for release in a week or two. |
Steps to Reproduce
Indigo 1.10.0 installed via pip install epam.indigo
Python 3.10.10
Ubuntu linux
Expected behavior
Normal iteration without segfault
Actual behavior
Segfault
Attachments
US06174985-20010116-C00003.zip
Indigo version
1.10.0
Additional context
The CDX file comes packed in the following USPTO download file: https://bulkdata.uspto.gov/data/patent/grant/redbook/2001/20010116.ZIP this seems to be happening with other CDX files.
The text was updated successfully, but these errors were encountered: