In this video, we're going to talk about peptide mass fingerprinting. Peptide mass fingerprinting can be abbreviated as PMF, and it's a technique that we use to identify unknown proteins. It uses the mass spectrometry spectrums from very large computer databases to identify unknown proteins. If we have an unknown protein sample, we can take a little bit of this sample and subject it to tandem mass spectrometry to acquire an actual spectrum of the unknown protein. Then we compare this actual spectrum to the spectrums of all known proteins stored in computer databases until a match is found. Once a match has been made, we are able to identify the unknown protein. Typically, it's only necessary to analyze very small portions of the protein to identify it, which speeds up the process.
A limitation to peptide mass fingerprinting is that the database must already include the protein to identify it. If the database doesn't include the unknown protein, then a brand new protein that has never been discovered before is identified. We can fully sequence this unknown protein and enter its sequence into the database, so that later, if anyone extracts that same unknown protein, they can use peptide mass fingerprinting to quickly identify it.
Let's take a look at our example of peptide mass fingerprinting to clarify this concept better. We have two brackets here. The first bracket, highlighted in yellow, represents the entire tandem mass spectrometry process covered in previous lesson videos. The second bracket refers to peptide mass fingerprinting, the main focus of this video. We start with an already purified protein and subject it to fragmentation with a chemical or a protease, generating different protein fragments. These fragments are then ionized and subjected to the first mass spectrometry, which acts as a filter to select specific peptide fragments to continue forward in the process. Here, it's possible that the first mass spectrometry generates an actual spectrum, which is acceptable. With tandem mass spectrometry, only particular peptide fragments are selected to move forward. For example, we might subject a fragment to a collision cell to fragment it further, and then subject it to a second mass spectrometry (MS/MS) to generate a spectrum of this particular peptide fragment. We generate spectrums from multiple rounds of MS/MS, which are unique and specific enough to act like fingerprints. We then enter these MS/MS spectrums into a computer database, and the computer searches the database until it finds a match for these spectrums. The spectrums in the computer database are linked to protein sequences. Once a match is made, it's relatively simple to identify the subunits these fragments originated from and identify the original unknown protein.
This concludes our lesson on peptide mass fingerprinting, where we use tandem mass spectrometry to generate unique mass spectrums and enter them into a computer database until a match is found. In our next video, we will discuss other ways to sequence a protein. See you in our practice session.