Protein charge and isoelectric point (pI) – “basic” vs “acidic” etc..Whether, in which direction, and to what extent, a protein is charged depends on “how it’s spelled” (what protein letters (amino acids) does it have in what order?) and what the pH is. And the 2 are directly related – pH is a measure of how many protons (H⁺) are in a solution – it’s an inverse log, so the more H⁺ (more acidic), the lower the pH. Some amino acids are able to give protons (act as acids) or take protons, (act as bases) depending on the proton availability, and since those protons are positively-charged, give/take-ing them changes the protein’s charge. Gaining a proton (protonation) can positivize a neutral or neutralize a negative whereas losing a proton can negativize a neutral or neutralize a positive. We can predict the overall charge of a protein using its isoelectric point (pI), which is the pH at which a protein is overall neutral. More “basic” proteins will have a higher pI and tend to be positively-charged and more “acidic” proteins have a lower pI and tend to be negatively charged (although the charge depends on the pH! so we’re usually talking about at physiological (bodily) pH of ~7.4). Here’s why…

note: the text is adapted from past posts. those posts have more on ion exchange chromatography, but I wanted to focus more on protein charge in depth today. & figures: you can find that post here:

I like to visually think of solutions as being a sort of “proton cookie jar” and the pH reflects the fullness of that jar. The fuller the jar (more free proton “cookies” floating around) the lower the pH and vice versa. Acids and bases are like friends donating (in the case of acids) or taking (in the case of bases) proton “cookies,” so the number of acids & bases in a solution influences the pH. But not all acids and bases are created equal – some acids are more generous and some bases are more greedy. ⠀

Proteins are like a big linked-up group of friends where some of them are cookie givers or takers. The more “proton-greedy” a protein is, the more positively-charged it will be, and it will remain charged even when there’s not a huge excess of protons (super acidic solution)(cookie jar not very full). Conversely, a “proton-generous” protein will tend to be negatively-charged unless the pH gets really low (cookie jar overflowing).⠀

No matter how greedy or generous your protein is, there’s some “tipping point” where the protein is neutral overall. I stress “overall” because the individual letters are the places where the charges are coming and going – and they act “independently” – so the overall charge of the protein is the sum of all those pluses and minuses. When the number of pluses = the number of minuses the protein has no overall charge (even if parts of it remain charged) and we call the pH at which this global neutrality is reached the isoelectric point, pI. ⠀

Interactions between charged chains & water contribute to keeping proteins dissolved. & like charges repel so when protein molecules are highly charged the same say they can repel each other, so usually a protein is the LEAST soluble at its pI, but this isn’t always the case.⠀

For any protein you know the sequence of, you can calculate the theoretical pI using a pI predictor (free online software like ExPASY ProtParam can do this for you). What it gives you is just an estimate of the pI, since the letters can be locally influenced by their neighbors. So the pI its algorithm spits out is a “best guess” about how proton-greedy your protein is. It tells you the “theoretical pI” – above this pH, there aren’t enough protons for all the takers to take them*, so your protein is likely negatively charged (anionic, and thus suitable for anion exchange chromatography). Below the pI, there are plenty of protons for even the reluctant takers to take them, so your protein is likely positively-charged (cationic, and thus you would choose to use cation chromatography). ⠀

*note: it’s not really that there are no free protons – there always are lots of them, but if you think about protons coming and going frequently, when there are fewer protons around it’s harder to grab another one if you let go. So I don’t want to confuse anyone with my oversimplified analogy, but I personally find it helpful when trying to figure out whether something will be charged or not⠀

If you need a review of pH, check out:

But here’s the gist and how it relates to protein charge. Molecules are groups of atoms held together by strong, “covalent” bonds – the type of bond that doesn’t break easily. Those atoms (individual hydrogens, carbons, oxygens, etc.) are made up of even smaller parts called “subatomic particles” which include protons (which are positively-charged), neutrons (which are neutral), and electrons (which are negatively-charged). ⠀

Even though they’re attached by covalent bonds (strong bonds that involve the sharing of pairs of electrons), bonds to H are weaker than bonds to other atoms. H’s are really small & don’t offer much latchbility so they’re relatively easily removed. Just how easily depends on what they’re attached to. Hydrogen atoms only have 1 proton & 1 electron & they often leave that electron behind when they go, so all they’re left with is a H⁺, so we often call H⁺ a proton.⠀

These protons can come from water itself, which is mostly H₂O, but also contains O ⁻ & H⁺. Or they can come from other molecules, including proteins. Protons can easily find a water to latch onto (forming hydronium ion (H₃O⁺) because the vast vast majority of water is in water form (not ionized). The ions are spread out throughout the solution, but for the sake of visualization you can think of all the protons as being in a jar, and for the sake of fun analogy-ness you can think of these protons as cookies! So, water is just cookies waiting to be made!⠀

You can think of a solution kinda like a middle-school cafeteria with a cookie jar. pH is a description of how full the jar is and it is a property of the cafeteria as a whole – it comes from contributions from all the students (molecules), some of whom share cookies and some of whom take them, and proteins are like “cliques” of students whose contributions we can view at the individual student (amino acid) or clique (protein) levels. ⠀

Like any molecules, proteins are made up of atoms (of elements like hydrogen (H), carbon (C), oxygen (O), nitrogen (N), & sulfur (S)) joined together through covalent bonds (strong bonds that involve atoms sharing pairs of electrons). In any solution you have a fixed # of each type of atom – that is, you can’t do alchemy & voila suddenly have gold. You can’t change the total # of hydrogens, but you can change how they’re distributed.⠀

The atoms in proteins are arranged into amino acid building blocks that are linked up to form proteins. You can think of the individual amino acids as “students” that link up to from large “cliques.” There are 20 (common) amino acids, & proteins can be made up of “any” # of any combination of them (e.g. you can have 15 “Janes” in one clique).⠀

We can talk about “cookie-having” at the clique (protein) level or the friend (amino acid level). At the clique (protein) level we talk pI (isoelectric point) and it comes from what’s happening at the friend (amino acid) level, which we measure as pKa.⠀

Some amino acids have more H⁺s than other amino acids. If they “look at the jar” & see that there are plenty of cookies, they say “why should I give up my cookie if there are already cookies out there?” So they keep it. Others would like a cookie but they have better “self-control” – not until the jar’s pretty full will they take some for themselves.⠀

Different amino acid “friends” have different levels of greediness, which we describe using a # called pKa. The pKa is the pH at which 1/2 of a thing will be protonated & half won’t be. Below the pKa, there are more free H⁺ (fuller cookie jar), so the thing’s more likely to be protonated. Above the pKa there are less free H⁺ (less cookies in the jar) so the thing’s more likely to be deprotonated. ⠀

But not all things are protonatable/deprotonatable. They have to have an H for one thing that H can’t be too tightly held. Each location that is protonatable/deprotonatable has its own pKa. So let’s look where on proteins we can find these sites.⠀

Amino acids all have the same generic backbone that they use to link together & then they have unique “side chains” aka “side groups” aka “R groups” The generic backbone part has an amino (-NH₂) group on 1 end & a carboxyl -(C=O)-OH group on the other end. They link up amino to carboxyl in an amide aka peptide bond —NH-(C=O)— , shed a water in the process & the part that’s left we often refer to as the “residue” – so when alanine is by itself, we call it a free amino acid. But when it’s in a chain (peptide) we call it a residue.⠀

When you have free amino acids, they all have at least 2 groups that can give or take hydrogens. The amino end can exist as -NH₂ (neutral) or -NH₃⁺ (+ charged) & the carboxyl group can exist as -(C=O)-OH (neutral) or -(C=O)-O⁻ (- charged). So a free amino acid can be +, -, or neutral. The pKas of these groups depend on the amino acid but are typically ~10 & ~2 respectively.⠀

Physiological pH (the pH inside your body) is about 7.4, which is above 2, so most of the carboxyl groups are protonated & thus in the negatively-charged carboxylate form. But below 10, so most of the amino ends are in the +-charged NH₃⁺ form. So the charges cancel out – we call this a zwitterion

When you link amino acids together, you’re only left with 1 amino end & one carboxyl end. So you only have to take those group’s protonation state into account once. But some of the side chains in the residues in between those ends can get protonated or deprotonated as well & when we talk about pKa’s for amino acids in the context of proteins or peptides, we usually are talking about the pKa of the side chain (if there is one). Sometimes this is referred to as the pKR or pK3.⠀

There are 2 side chains that are frequently deprotonated at physiological pH – we call these acidic because they donate H⁺s & when they do they become negatively charged & now capable of accepting H⁺s (acting as a base) so we call them “conjugate bases.” It can seem kinda confusing because “acidic” residues often play important roles by acting as bases in their deprotonated form. The “acidic” refers to its neutral form being acidic.⠀

These side chains are: Aspartic acid (Asp, D)(pKa ~3.65), which gives up a H⁺ to become aspartate & Glutamic acid (Glu, E)(pKa ~4.25) which gives up a H⁺ to become glutamate. Cysteine (Cys, C) & tyrosine (Tyr, Y) can can also deprotonate to give negatively charged chains, but they do so much less readily – pKa of 8.4 for Cys & 10.5 for Tyr.⠀

There are 3 side chains that are frequently protonated at physiological pH – we call these “basic.” Lysine (Lys, K) (pKa ~ 10.28) & Arginine (Arg, R) (pKa ~13.2) are predominantly protonated at cellular pH, but Histidine (His, H) (5.97) is more “iffy” (remember that pKa tells you when HALF the groups are deprotonated on average so it’s not like you hit the pKa & bam they’re all deprotonated – you have a mix. Also, an important caveat is that pKas are context-dependent so the pKa you get from a table is likely close to but not exactly the “real” pKa in the situation you’re looking at.⠀

So this is all happening at the “friend” level, but what if we want to know about the clique’s “cookie-ness”? We need to take into account that clique’s unique combination of friends -> We calculate the pI (isoelectric point), that pH at which the protein is neutral overall. It doesn’t mean that each individual amino acid is neutral, just that the non-neutral ones balance out.⠀

So proteins with high pIs are like cookie-greedy cliques. You have to get them to see that the jar’s really low & others are desperately cookie-deficient before they’ll donate (at the net level)⠀

The higher the pI, the more H⁺-rich the protein is likely to be -> you have to take it to a higher pH (more basic meaning less free H⁺) before it will shed H⁺s. We often refer to such proteins that are + charged at neutral pH as “basic” & they get that “basicity” because they have lots of basic residues (His, Lys, &/or Arg). They’re great for binding negatively-charged things like DNA or RNA.⠀

Proteins with a low pI are like generous cliques. They don’t need cookies to be happy and will only take it once they know there’s plenty for everyone!⠀

Proteins with pI’s below neutral are “acidic” and they usually have lots of Glu’s and Asp’s.⠀

in summary:

  • pH is a measure of free protons (H⁺) which are positively charged, but pH is on a negative log scale so LOWER pH means more positively-charged protons floating around
  • some parts of proteins can grab onto them – the more there are (lower the pH) the more likely this is → increases charge (an make neutral side chains ➕ charged or ➖ charged side chains neutral)
  • when pH is high, kiss those H⁺ goodbye! at high pH there aren’t many protons so this is less likely and some protein parts can actually donate them to the cause → decreases charge (➖ side chains can’t get neutralized (they stay ➖)). And ➕ side chains have to “give up” their H⁺ (they get neutralized)
  • proteins have different combinations of parts that give & parts that take
  • as a result, a protein has an overall charge that depends on the pH

Each protein has a specific point at which it at which the protein overall is neutral – there can be ➕ & ➖ charged chains, but they perfectly cancel each other out & we call this charged but neutral condition one of my favorite words: zwitterionic. The pH at which it occurs the pI or isoelectric point

  • Above the pI, the protein is ➖
  • Below the pI, the protein is ➕⠀

more on buffers and the Henderson-Hasselbalch equation: 

Leave a Reply

Your email address will not be published.