RELEVANT READING
Researchers at all career levels need to engage in substantial self-study. Formal classes/workshops on topics are wonderful ways to learn (likely preferred, in my opinion), but these options are not always available or viable on a timescale that meets your needs. I frequently get asked “how much time should I be reading/studying compared to <insert alternative activity here>?” You should not view dedicated reading/study as a waste of time if it is something that you really need to know. In fact, a good grasp of fundamental knowledge is likely to save you significant time in the long run, since you will have a better grasp on what you’re doing. What I would advise is to not let the study be aimless. Understand what you are hoping to learn and target your study around that. If you need to pick up some other background to make it happen, then do it. If you are unsure about whether learning something would be “useful,” then feel free to ask, and I may be able to help.
In the following, I outline some useful resources on various topics. We should have copies of most of the books listed, while others can be accessed from the library. Books for which we have copies will be denoted with a carat (^)
Molecular Simulation
There are two “classic” books that are typically owned by anyone who is serious about molecular simulation:
^Understanding Molecular Simulation by Daan Frenkel and Berend Smit (FS) (Recommended coverage: Ch. 1-7, 12, Appendix D, F, L; other concepts as needed)
^Computer Simulation of Liquids by Michael Allen and Domonic Tildesley (AT) (Recommended coverage: most of the book, but essentials are Ch. 1-5, 12, 13; other concepts as needed)
I recommend actually reading these books – especially the first few chapters; read them and then read them again and again. Then, revisit them. If you have a very foundation in the fundamentals of molecular simulation, you are well-equipped to undertake a vast range of computational research problems; you will also be able to understand and diagnose errors more rapidly. AT is a little more verbose than FS, which can be simultaneously beneficial and detrimental. Here is a standard to which you can apsire: prior to leaving my group, you should ideally be able to scan the Table of Contents for either book and know at least something about everything lists.
I also highly recommend The Living Journal of Computational Molecular Science, which has emerged as a very nice pedagogical and practical resource for newbies and veterans of molecular simulation. The best practices articles provide information that may be evident based on reading of FS or AT but are packaged in compact manner with allusion to other great resources. At a minimum, new members should read all of the following articles:
Other general resources:
^Molecular Modelling Methods: Principles and Applications by Andrew Leach – another reasonable book, but I prefer FS or AT for most graduate students. This book does have better/more substantial description of computational quantum mechanics.
Lecture notes/materials from Prof. M. Scott Shell’s Course on Modern Molecular Simulation Methods
^My lecture notes/videos for CBE422: Molecular Modeling Methods
^A Guide to Monte Carlo Simulations in Statistical Mechanics by Landau and Binder - FS and AT cover Monte Carlo methods to an extent that is sufficient for most people in the group, but this is a more comprehensive resource that may be useful for certain projects.
Enhanced Sampling and Free Energy Methods:
Many of the molecular simulation references will provide some reasonable introduction/explanation of free energy and enhanced sampling methods. However, I would also suggest reading this paper on a comprehensive software suite (SSAGES) that implements various enhanced sampling techniques for some rapid exposure and connection to other references. It is good to study the original papers for methods that you plan to use.
Coarse-graining Methods:
Students in the group may be anywhere from well versed to experts on coarse-graining methodology. Even if not able to implement or execute the methods yourself, one should understand the basic mechanics of major CG approaches. If you are going to be using a CG method, it is important that you thoroughly read the keystone papers on the subject. To get your bearings in the field of coarse-graining and multiscale simulation methods, I can suggest reading our review article on Chemically specific coarse-graining of polymers: Methods and prospects. This should give you a good initial presentation such that you can more confidently engage in older and newer literature. Another nice educational resource would be the VOTCA documentation.
Statistical Mechanics
If you can take a course dedicate toward statistical thermodynamics in your first year, then I think you should do it. The graduate CBE thermo class should provide some exposure, but realistically, not as much as I would like. The basis of a large number of calculations/analysis from our simulations derive from the framework of statistical thermodynamics – you must understand that connection. Therefore, you need a firm grasp of how different simulation algorithms relate to concepts in statistical mechanics. Here are some good potential references:
^Statistical Mechanics: A Concise Introduction for Chemists by Widom - good, brief introduction into stat mech. This is an appropriate place to start if someone received relatively little exposure during undergraduate preparation. <180 pages in total.
^Thermodynamics and Statistical Mechanics by Shell - gentle and integrative exposure to stat mech along with classical thermodynamics. This is the book that has been used in grad thermo.
^Statistcal Mechanics: Theory and Molecular Simulation by Tuckerman - a terribic book that presents theoretical underpinnings of a variety of common simulation methods.
^Statistical Mechanics by McQuarrie - probably my go-to reference. It can be a bit of a slog, but the essential elements that we consistently rely on are mostly clear.
^Introduction to Modern Statistical Mechanics by Chandler - another good book. It is often characterized a book that is best for people who already know stat mech.
An Introduction to Statistical Thermodynamics by Hill - comprehensive Dover (cheap!) book
Theory of Simple Liquids: with Applications to Soft Matter by Hansen and McDonald - comprehensive coverage classical liquids with specific discussion on simulation methods
^Molecular Driving Forces by Dill and Bromberg - expansive but digestible text, nice for some selected topics
Nonequilibrium Statistical Mechanics by Zwanzig - more detailed introduction and coverage on topics like fluctuation-dissipation, linear response, projection operators etc., which may be introduced in more limited or cursory fashion in prior texts. Nice appendices.
Polymer Physics/Soft Matter
Because of the problems that we typically target, it is generally beneficial to have some basic understanding of polymer physics. Depending on what you are doing, it may be critical. Importantly, we need to share a common vernacular. I personally would not characterize myself as a polymer physicist; I am a guy that knows some polymer physics. As with stat mech, and if warranted by your project, taking a dedicated course can be worth the time. In either case, the following are recommended:
^Polymer Physics by Rubinstein and Colby - start here. This is easy-to-read and provides most exposure that you need
^The Theory of Polymer Dynamics by Doi and Edwards
Statistical Mechanics of Chain Molecules by Flory
The Structure and Rehology of Complex Fluids by Larson
The CROW polymerdatabase has nice pages on a range of topics (e.g., the rotationa isomeric state model that I bring up at least once during a Ph.D)
Machine Learning/Data Science
Increasingly we are employing machine learning as either an integral or complementary tool in our work. Some folks are definitely staying “out of the game,” but to be fluent in group culture, you’ll need to know some basics. You can probably figure out what you need to know with a bit of basic reading and then paying mindful attention to others and asking questions. A lot of times the algorithms can be fancy and complicated, but the concepts are simple.
If machine learning is going to be a major component of your research, then formal and targeted introduction through coursework in your early years is encouraged. That being said, self-study can also be very effective because there are many free educational resources on the topic. To start, I recommend reading any reasonable introduction book. This will give you a basis to comprehend essential vernacular and algorithms that enable to you learn about more advanced topics.
Here are some useful resources:
^Deep Learning with Python by Chollet – Book from the creator of Keras. Simple, practical, and quick read.
Python Machine Learning by Rashka and Mirjalili – I think an earlier version of this book is the first one that I read.
^Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control – This is a newer book that provides a pretty good exposure to some topics, but the level of depth varies by topic. It is complemented by their freely available online lectures and github code, which is nice.
Deep Learning for Molecules and Materials by Andrew White – pretty much the best first game in town for introduction to machine learning in the context of chemistry and materials science. New topics are added periodically to maintain technical relevance. Definitely consult this book.
*Mathematics for Machine Learning by Deisenroth, Faisal, and Ong - very nice coverage/review of important mathematical concepts.
Deep Learning by Goodfellow, Bengio, and Courville
*Gaussian Processes for Machine Learning by Rasmussen and Williams - surprisingly digestible and understandable theory of Gaussian Processes
Then, I also highly recommend checking out these articles:
“Taking the Human Out of the Loop: A Review of Bayesian Optimization” by Swersky et al. - very useful article discussing Bayesian optimization, acquisition functions and the like, which are important for a lot of our applications
“Best practices in machine learning for chemistry” by Artrith et al. - title says it
“Machine Learning for Materials Scientists: An Introductory Guide toward Best Practices” by Wang et al. - title says it
“Physics-Inspired Structural Representations for Molecules and Materials” by Musil et al. - title says it
Finally, you should not underrate the plethora of excellent articles and tutorials put together by true heroes on
Quantum Chemistry
Disclosure: I am not an expert in quantum chemistry (QC) or electronic structure theory. Nonetheless, elements of quantum chemistry do enter into our research. For one thing, it is essential that you understand the relationships amongst the hierarchies of methods that we encounter. If you do not know the term “Born-Oppenheimer Potential Energy Surface’’ and how that relates to a classical force field, then you need to brush up on some quantum. In addition, we may pursue some QC calculations from time-to-time. The most common instances are when parameterizing force fields or using QC for machine learning tasks. Some familiarity will also be necessary to understand and interaction with relelvant literature. The following books may be of interest:
^Molecular Modelling Methods: Principles and Applications by Andrew Leach – this was listed earlier already with the allusion to its QC content.
^Introduction to Computational Physical Chemistry by Joshua Schrier – lovely pedagogical introduction to computational chemistry overall, but I think the quantum parts are particularly well done (especially Huckel theory)
^Modern Quantum Chemistry: Introduction to Advanced Electronic Structure Theory by Szabo and Ostlund - another cheap Dover book. I believe it is a staple for electronic strucure theorists.
For more basic exposure, you could also consult an undergraduate QC textbook. I thought Introduction to Quantum Mechanics by Griffiths was particularly clear and ^Quantum Chemistry by McQuarrie is solid.