In this video, we're going to do an overview of direct protein sequencing. This video is quite unique because it is just an overview. We're going to mention a bunch of different techniques that we have not yet talked about, but we will cover each of those techniques in more detail as we move forward in our course. In this video, I don't want you to get caught up in any one particular step or technique because I want you to focus on the overall big picture of direct protein sequencing. A lot of our confusion on these individual techniques will be cleared up as we move forward in our course. With that being said, let's get started.
First, I want you to recall that the primary level of protein structure is really important to biochemists because it dictates all the other levels of protein structure, as well as the functions of that protein. The primary level of protein structure includes both the amino acid composition or the number and types of amino acids, as well as the amino acid sequence. The amino acid sequence is also known as just the protein sequence, which is the particular order of amino acids from the N-terminal end of the protein to the C-terminal end of the protein. Biochemists can use the sequence of a protein to obtain more information about that protein because they're able to use the sequence of the protein to predict both the overall three-dimensional structure of that protein, as well as the functions of that protein. It's really important for biochemists to have methods, different methods, for obtaining the protein sequence of an unknown protein. That's really what this video is all about.
It's important to keep in mind that most proteins in nature are quite large. They can vary from having several hundreds of amino acids to several thousand amino acids. It's really important that we cleave and fragment our large proteins down into smaller peptide fragments before sequencing them. The reason for why we need to cleave down our large proteins into fragments before sequencing them will be discussed later in our course in another video. For now, just know that in our example down below, we're going to be mentioning different techniques that are used to cleave and fragment our protein down into smaller pieces. Keep that in mind.
In our example below, what you'll notice is that we have a diagram, which you can use as a map that will guide you through the next set of lesson videos. The beginning of our map is over here, on the far left, and essentially what we're going to do is follow the arrows to the end of our map. This first portion of our map over here is really just a review. We know that when we're interested in sequencing a protein, we're going to need a source of that protein, typically cells. So here, we're performing cell lysis so that we can perform protein extraction and obtain all of our proteins, especially the proteins of interest.
Here we have a protein mixture, where you can see we have a bunch of different types of proteins. We've got these green proteins, black proteins, blue proteins, and gray proteins, and they're all different colors and shapes. You can see we've got this orange protein right here, which is the particular protein of interest in this example. When we have a protein mixture, we know that we can perform a set of different protein purification techniques, such as all of the different types of chromatography that we already covered in our previous lesson videos, such as ion exchange chromatography, size exclusion, affinity chromatography, HPLC. Upon protein purification, we're able to take our particular protein of interest and isolate it. This is really where we're starting on our map. Nothing we've talked about from this point through this point is new information; it's all a review from our previous videos.
Once we have our isolated purified protein, we have two different paths we can take. We can take this path going up leading to a sequenced protein, or we can take the path to the right that also leads to a sequenced protein at the very bottom, shown here. For those of you that have already seen our videos on tandem mass spectrometry and peptide mass fingerprinting, you know that this upper path is a gold standard for sequencing proteins. If you haven't seen those videos on tandem mass spectrometry and peptide mass fingerprinting yet, we're going to talk about them very shortly. Don't be too worried. This path up here is the gold standard path, and it's a short path to get us the result we want, which is a sequenced orange protein here, the protein of interest. But what happens if your lab does not have access to a mass spectrometer or a tandem mass spectrometer? How do you perform this technique up here without a tandem mass spectrometer? You can't. We need a different technique to sequence our protein. That's what this right path, leading down into our diagram, is all about.
We can take our sample, our isolated purified protein, and we can split it into as many samples as we want. In this particular example, we're splitting our isolated purified protein into just three different samples: sample number 1, sample number 2, and sample number 3. Going down following the arrow, what you'll notice is that we first encounter the path for sample number 3, then sample number 2, and lastly sample number 1. You'll also notice that for sample number 3 and sample number 2 paths, they have dotted lines, which are essentially just to say that technically we could go directly to samples 3 and 2, but it's not really the best and most logical path. The best path is actually following the solid start with sample number 1, and that's exactly what we're going to do here. We're going to skip these dotted lines here. We'll come back to them later, but we're going to start with sample number 1.
With sample number 1, what we can do is take a little bit of our isolated purified protein here and put it in sample number 1. Then we can treat our isolated protein with a chemical known as FDNB. Don't get too caught up in exactly what it does, but briefly, FDNB is used for determining the N-terminal amino acid residue of the protein. That can be very useful for biochemists, and we'll talk about why in our later videos. After we treat our isolated purified protein sample with FDNB, we can subject it to our first cleavage technique. Remember, our large protein of interest here, most of our proteins, are going to be very large. Even though our protein is being shown to have only 8 amino acid residues, shown by these 8 orange dots, we can imagine that this protein is actually really large with several hundred to a thousand of amino acids.
Our first cleavage technique is called amino acid hydrolysis. We're going to talk about this in more detail later in our course. Essentially, it non-specifically cleaves all of the peptide bonds in our protein. All these dotted black lines that we see vertically going up and down are just referring to the cleavage of those peptide bonds. It results in free amino acids or amino acids that are not linked via covalent peptide bonds. After amino acid hydrolysis, we can subject these free amino acids to a technique such as HPLC. We can use HPLC to determine the amino acid composition. Recall that the amino acid composition only refers to the numbers or the abundance of amino acids, as well as the types of amino acids that are present. However, the composition does not tell us the order of amino acids or the sequence of amino acids.
Essentially, sample number 1 does not get us to where we need to be, which is obtaining the sequence of our proteins. But because it gives us the composition of our protein, we can use this composition to strategically select the appropriate reagents that we're going to use in sample number 2, as well as in sample number 3. That's why it's important to use sample number 1 here first in amino acid hydrolysis before we get to samples 2 and 3. Notice that the next step is to either follow this arrow through to step number 2 or follow the arrow and go to step number 3. It doesn't really matter which path we're going to take. First, we'll follow step 2, and then we'll move on to step 3.
In sample number 2, we can take another little bit of our isolated protein sample and move it down into our second step down below. We can then subject this isolated protein to a chemical reagent to perform chemical cleavage. Chemical cleavage essentially involves using very specific chemicals to cleave very specific peptide bonds. We're going to talk more about chemical cleavage as we move forward later in our course. There are a lot of different chemicals that we can use to perform chemical cleavage. Here, we're performing chemical cleavage that cleaves this one particular peptide bond. Not that it always cleaves one particular peptide bond, but chemical cleavage will only cleave very specific peptide bonds. This one position here generates two different fragments, a 3-residue fragment and a 5-residue fragment. We can see those down below, the 3-residue and the 5-residue peptide fragments.
All these arrows here are merging down to this one arrow down here. Before we continue forward, let's move over to sample number 3. In sample 3, we can take our last bit of our isolated purified protein sample and bring it down to this sample here. In this sample, we can subject our purified protein to a peptidase. Peptidases are enzymes, and they cleave very specific peptide bonds. In this particular example, the peptidase that's being used is cleaving at two different positions. It's cleaving at this peptide bond here and this peptide bond here. That generates three different fragments, a 2-residue fragment, and two 3-amino acid residue fragments. We can see those fragments down below here.
There are actually a lot of different types of peptidases. We're going to talk about all of these different peptidases in more detail as we move forward in our course. After we've generated all these peptide fragments from these cleavage techniques, we can separate all the peptide fragments using some kind of separation technique, such as HPLC. Once we've separated all the peptide fragments, we can subject each of these peptide fragments individually to peptide sequencing, sequencing the peptides, via techniques such as Edman degradation. Edman degradation requires that large proteins be cleaved down into fragments before they can actually be sequenced. The reason why we'll talk about in some of our later videos. But for now, what you need to know is that after we cleave our fragments down into our protein down into fragments, we'll separate the fragments, and then we'll subject them to sequencing. We can use the Edman degradation results essentially to obtain the end result, which is our sequenced orange peptide. You can see all the one-letter amino acid codes for those residues have been revealed, and we've successfully sequenced our protein.
This is really a map to the next set of lesson videos. We'll first start talking about FDNB moving forward, and then we'll talk about amino acid hydrolysis, followed by chemical cleavage, and then lastly, we'll end up with peptidases and Edman degradation. It's important to keep in mind that this is really just a map. This is where you are now, and this path here, going down is essentially the path that we're going to take as we move forward with our lesson videos. I would suggest you print off this map and have it side by side with you as we move forward in our lesson video, so you know exactly where we are in this map. This here, ends our initial lesson on the overview of direct protein sequencing, and we'll be able to get some practice in our next video. I'll see you there.