Detailed description |
|
Revision: 699
Description:
featurize gives warnings that it can't find PDB files when using RCSB "divided" data stores. RCSB "divided" data stores uses a directory called "divided" and a subdirectory named for the middle two letters of the PDB ID. For example:
Given PDB_DIR=/usr/local/feature/data/pdb
1A2L would be found as
/usr/local/feature/data/pdb/divided/A2/1A2L.pdb.gz
Somehow featurize calculates the subdirectory to be "A2a2" and fails to find the file.
How to Repeat:
1. Download PDBs using the RCSB "divided" data store structure.
2. Checkout and build r699 on Linux.
3. Run featurize on a pointfile.
Repeatability: 100%
Workaround:
Don't use the RCSB "divided" data store and instead use a flat file system. Current SeqFEATURE models and Thioredoxin use about 5,000 or so PDB files. All targeted systems (Mac OS X, Desktop Linux, and Cluster Linux) can handle this many files in a directory. |
|