

Parse a remote xml.gz file of a database without downloading
I need to parse a Pubchem database to search for certain clues on the pages of compounds
(Toxicity codes, to be exact, they look like 'H300'), and then add their CIDs to the correspondent lists
The Database is here
https://ftp.ncbi.nih.gov/pubchem/Compound/CURRENT-Full/XML/
But the xml.gz files there are so big that they can't be unpacked on my computer
So maybe there is a way to read this files directly on the server of a PubChem