-
Notifications
You must be signed in to change notification settings - Fork 126
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pocket representations have different dimensions for pair_repr #234
Comments
Is it this pocket repr demo? |
Hello @ZhouGengmo Thanks a lot for the reply. Everything is fine when I run the unimol_pocket_repr_demo notebook with your provided data. However, when I run the same code with my own input, I encountered the dimension problem. I suspect it was due to different pocket size, but your input also have difference pocket sizes, so there might be some pre-processing steps that I missed? My input pdbs are 4jym and 5dj5 removing their bound ligands. The pocket json is: The output dimensions I got for 'pair_repr' are: (111, 111, 64) and (172, 172, 64) |
It is normal for the dimensions to differ. The dimensions of the representation are related to the pocket size. This is also reflected in the example data CASF2016, where not all data have the same dimensions. For instance:
It is also recommended to use unimol_tools, which are more user-friendly. |
Thank you @ZhouGengmo , How would you recommend to treat these representations of different dimensions for comparison? |
Recommend using the CLS representation to represent the entire pocket. The CLS representations of different pockets have the same dimensions, i.e., |
Hi @ZhouGengmo , There doesn't seem to be a pocket representation implementation yet in unimol_tools? In the meantime, if I continue to use the notebook implementation mentioned above, would 'mol_repr' which provides (512,) dimension output for all pockets the same as "mol_repr_cls"? If this is the case, does that mean it should be "molecular representation" (mol_repr or mol_repr_cls) annotated in your figure below, instead of "atom representation" for pockets? If not, could you please help elaborate on the different representation outputs for pockets? |
Yes, in this demo,
This figure is ok. CLS is a special token added before all atoms and is used to represent the whole molecule/pocket. The atom-level representation of Uni-Mol is not included in this demo. Do you want to use the atom representation? I will add it to this demo ASAP. |
Adding the atom representation would be great, Thank you! |
Added in this pr #247. You can pull the latest code. |
Hello,
I tried to generate pocket representations of my own dataset using the code provided in the demo notebook. I got pair representations of different dimensions for different molecules, e.g. [n,n,64], where n is different for different molecules.
On the other hand, when I rerun the demo case, I got the same dimensions for all pair representations.
Could you please help to point out which steps I could have missed?
Thanks!
The text was updated successfully, but these errors were encountered: