Hi,<br>I&#39;m using Gromacs 4.5.5 to simulate a protein in explicit solvent. <br>The simulation is running fine and I have collected 9 100ns simulations. I&#39;m eliminating the first 50 ns for equilibration and sheer data size.<br>

What I want to do now is to cluster the simulations based on rmsd of certain residues. I have used make_ndx to select the residues and made a group.<br>I have used trjcat -cat option to concatenate the 9 runs into a single giant .xtc file. In the .xtc file, I only output the protein. <br>

My plan was to use g_rms to create a rmsd.xpm table and using that with g_cluster. However, when I do run g_rms, it seems like it will take weeks to finish. <br>The RMSD matrix size becomes 225000x225000 and my question is: How do you cluster this much data? Any recommendation would be useful.<br>

<br>Thank you for your time.<br>