Molecular dynamics simulation methods produce trajectories of atomic positions (and optionally velocities and energies) as a function of time and provide a representation of the sampling of a given molecule's energetically accessible conformational ensemble. As simulations on the 10-100 ns time scale become routine, with sampled configurations stored on the picosecond time scale, such trajectories contain large amounts of data. Data-mining techniques, like clustering, provide one means to group and make sense of the information in the trajectory. In this work, several clustering algorithms were implemented, compared, and utilized to understand MD trajectory data. The development of the algorithms into a freely available C code library, and their application to a simple test example of random (or systematically placed) points in a 2D plane (where the pairwise metric is the distance between points) provide a means to understand the relative performance. Eleven different clustering algorithms were developed, ranging from top-down splitting (hierarchical) and bottom-up aggregating (including single-linkage edge joining, centroid-linkage, average-linkage, complete-linkage, centripetal, and centripetal-complete) to various refinement (means, Bayesian, and self-organizing maps) and tree (COBWEB) algorithms. Systematic testing in the context of MD simulation of various DNA systems (including DNA single strands and the interaction of a minor groove binding drug DB226 with a DNA hairpin) allows a more direct assessment of the relative merits of the distinct clustering algorithms. Additionally, means to assess the relative performance and differences between the algorithms, to dynamically select the initial cluster count, and to achieve faster data mining by "sieved clustering" were evaluated. Overall, it was found that there is no one perfect "one size fits all" algorithm for clustering MD trajectories and that the results strongly depend on the choice of atoms for the pairwise comparison. Some algorithms tend to produce homogeneously sized clusters, whereas others have a tendency to produce singleton clusters. Issues related to the choice of a pairwise metric, clustering metrics, which atom selection is used for the comparison, and about the relative performance are discussed. Overall, the best performance was observed with the average-linkage, means, and SOM algorithms. If the cluster count is not known in advance, the hierarchical or average-linkage clustering algorithms are recommended. Although these algorithms perform well, it is important to be aware of the limitations or weaknesses of each algorithm, specifically the high sensitivity to outliers with hierarchical, the tendency to generate homogenously sized clusters with means, and the tendency to produce small or singleton clusters with average-linkage.
Nonlinear least squares fitting was used to assign rate constants for the three-barrier, two-site, double-occupancy, single-filing kinetic model for previously reported current-voltage relations of (5F-Indole)Trp(13) gramicidin A and gramicidin A channels (, 75:2830-2844). By judicious coupling of parameters, it was possible to reduce the parameter space from 64 parameters to 24, and a reasonable fit consistent with other experimental data was obtained. The main features of the fit were that fluorination increased the rate constant for translocation by a factor of 2.33, consistent with a free energy change in the translocation barrier of -0.50 kcal/mol, and increased first-ion binding affinity by a factor of 1.13, primarily by decreasing the first-ion exit rate constant. The translocation rate constant was 5.62 times slower in diphytanoyl phosphatidylcholine (DPhPC) bilayers than in monoolein (GMO) bilayers (coupled for the four combinations of peptide and salt), suggesting a 44.2-mV difference in the projection of the interfacial dipole into the channel. Thus fluorination caused increased currents in DPhPC bilayers, where a high interfacial dipole potential makes translocation more rate limiting because the translocation barrier was reduced, and decreased currents in GMO bilayers, where ion exit or entry is rate limiting because these barriers were increased.
Fluorination of peptide side chains has been shown to perturb gramicidin channel conductance without significantly changing the average side chain structure, which, it is hoped, will allow detailed analysis of electrostatic modulation of current flow. Here we report a 1312-point potassium current-voltage-concentration data set for homodimeric channels formed from gramicidin A (gA) or any of eight fluorinated Trp analogs in both lecithin and monoglyceride bilayers. We fit the data with a three-barrier, two-site, two-ion (3B2S) kinetic model. The fluorination-induced changes in the rate constants were constrained by the same factor in both lipids. The rate constant changes were converted to transition-state free-energy differences for comparison with previous electrostatic potential energy differences based on an ab initio force field. The model allowed a reasonably good fit (chi = 8.29 with 1271 degrees of freedom). The measured changes were subtle. Nevertheless, the fitted energy perturbations agree well with electrostatic predictions for five of the eight peptides. For the other three analogs, the fitted changes suggested a reduced translocation barrier rather than the reduced exit barrier as predicted by electrostatics.
Measurements are made of soot mass concentration in a luminous, liquid fuel spray, diffusion flame at atmospheric pressure. Intrusive sampling probes are used to study the effects of sampling rate, cooling, nitrogen-dilution ratio, and tip geometry on the mass of soot particles deposited on filters. Probe diameters have been kept small to minimize disturbance to the flow-field. Relative soot concentrations are observed to be lowest for uncooled probes, higher for water-cooled probes and still higher for probes with both water cooling and nitrogen injection. Furthermore, soot concentration steadily rises as the nitrogen/sample dilution ratio is increased from zero to as high as 1.5. Sampling rate has little effect on soot concentrations under most, but not all, sampling conditions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.