What is the isoelectric point (pI)?

The isoelectric point (pI) is the pH at which a protein's net charge is zero. At this pH the protein has minimum solubility and zero electrophoretic mobility. It is a critical parameter for designing purification processes and selecting chromatography conditions.

What does the Stokes-Einstein equation calculate?

The Stokes-Einstein equation D = kBT/(6πηRh) allows calculation of the translational diffusion coefficient D of a protein from temperature T, solvent viscosity η, and hydrodynamic radius Rh. It is used in DLS and analytical ultracentrifugation analysis.

How is the hydrodynamic radius Rh estimated?

For globular proteins, the empirical relation Rh ≈ 0.066 × MW^0.333 (nm) (MW in Da) is widely used. In denatured or aggregated states, the actual Rh is larger than this estimate.

In what unit is the sedimentation coefficient s expressed?

The sedimentation coefficient s is expressed in Svedberg (S) units, where 1 S = 10⁻¹³ s. This unit is also used in naming ribosomal subunits (e.g., 30S/50S). Typical protein values range from 1 to 50 S.

› Protein Properties & Molecular Weight Calculator Back

Biotechnology

Protein Properties & Molecular Weight Calculator

Real-time calculation of net charge, hydrodynamic radius Rh, Stokes-Einstein diffusion coefficient D, sedimentation coefficient s, and SDS-PAGE band estimate from molecular weight, pH, and temperature.

Protein Parameters

Input Mode

Amino Acid Residues:

Isoelectric Point pI:

Solution pH:

Temperature T:

°C

Ionic Strength:

Results

—

Mol. Weight (kDa)

—

Rh (nm)

—

D (μm²/s)

—

Sedimentation (S)

—

Net Charge (e)

—

Dialysis MWCO (kDa)

Net Charge vs pH Titration Curve

Main

Theory & Key Formulas

Stokes-Einstein:
$D = \frac{k_B T}{6\pi\eta R_h}$

Hydrodynamic radius (spherical approx.):
$R_h \approx 0.066 \cdot MW^{0.333}\text{ (nm)}$

Sedimentation coefficient:
$s = \frac{M(1-\bar{v}\rho)}{N_A f}$

What is Protein Characterization?

🙋

What exactly does this simulator calculate from just a protein sequence? It seems like magic!

🎓

Basically, it uses the amino acid sequence as a blueprint. For instance, by adding up the charges on each acidic and basic residue at a given pH, it calculates the protein's net charge. Try moving the pH slider above—you'll see the net charge change in real-time, which is crucial for techniques like ion-exchange chromatography.

🙋

Wait, really? So the "hydrodynamic radius" and "diffusion coefficient" are also predicted from the sequence? How?

🎓

In practice, we estimate the molecular weight (MW) from the sequence. Then, we use a power-law relationship to approximate the protein's effective size in solution, its hydrodynamic radius ($R_h$). A common case is a globular protein. When you change the input mode to "Manual MW" and enter a value, you can see how $R_h$ scales with MW.

🙋

That's cool. But what's the point of knowing the diffusion coefficient? Is that something scientists actually use?

🎓

Absolutely! The diffusion coefficient ($D$) tells you how fast a protein moves by Brownian motion. For example, in drug development, you need to know how quickly a therapeutic antibody diffuses to its target. The simulator uses the Stokes-Einstein equation to calculate $D$ from $R_h$. Adjust the temperature parameter and watch $D$ change—it's directly proportional to $T$.

Physical Model & Key Equations

The core model treats the protein as a sphere moving through a viscous fluid (the buffer). The hydrodynamic radius ($R_h$) is its effective spherical size. The fundamental relationship between size and diffusion is given by the Stokes-Einstein equation:

$$D = \frac{k_B T}{6\pi\eta R_h}$$

Where $D$ is the diffusion coefficient (m²/s), $k_B$ is Boltzmann's constant, $T$ is absolute temperature (K), $\eta$ is the solvent viscosity (Pa·s), and $R_h$ is the hydrodynamic radius (m). This equation shows that larger proteins (bigger $R_h$) diffuse more slowly.

Since we often only know the molecular weight (MW), we use an empirical scaling law to estimate $R_h$ for a typical globular, folded protein:

$$R_h \approx 0.066 \cdot MW^{0.333}\text{ (nm)}$$

Here, $MW$ is in Daltons (Da), and $R_h$ is in nanometers. The exponent 1/3 (or 0.333) comes from the volume scaling with mass. This is a simplification—unfolded or elongated proteins will have a larger $R_h$ for the same MW.

Real-World Applications

Chromatography Method Development: The calculated isoelectric point (pI) and net charge vs. pH profile are used to select the optimal pH and resin type for ion-exchange chromatography, a critical protein purification step in biopharmaceutical manufacturing.

Analytical Ultracentrifugation (AUC): The predicted sedimentation coefficient helps in designing AUC experiments and interpreting data to determine protein homogeneity, aggregation state, and binding interactions in solution.

SDS-PAGE Experiment Planning: The estimated apparent molecular weight on an SDS-PAGE gel allows researchers to select the correct percentage of polyacrylamide gel and to identify their protein band among standards, a daily task in molecular biology labs.

Drug Formulation & Stability: Understanding how diffusion and effective size change with temperature or buffer conditions is vital for formulating stable biologic drugs (like antibodies) and predicting their behavior during storage and delivery.

Common Misconceptions and Points to Note

Here are a few points that experimental researchers often stumble upon when starting to use this tool. First, understand that "the calculated pI is not an absolute value." The tool calculates using the "standard" pKa values for each amino acid. However, in actual proteins, the pKa of side chains can deviate significantly from these values due to influences from the local electric field or hydrophobic environment. For example, glutamic acid 35 in lysozyme has a pKa elevated to about 6 due to its unique environment. Therefore, it's not uncommon for calculated and measured values to differ by 0.5 to 1.0 pH units. While it's powerful as a guide, always confirm final experimental conditions with preliminary tests.

Next, remember that "the estimated SDS-PAGE position is just a guideline." This calculation assumes a standard globular protein. But real proteins can migrate slower due to post-translational modifications (like phosphorylation or glycosylation) or show anomalous migration, like membrane proteins. For instance, a heavily glycosylated protein with a calculated molecular weight of 70 kDa might show a band around 100 kDa on SDS-PAGE. Think of the tool's output as the theoretical position "if the protein behaved ideally in a denatured state."

Finally, make good use of the fact that "the hydrodynamic radius Rh reflects structural information." The tool calculates it from the molecular weight using a simple empirical formula $R_h \approx 0.066 \cdot MW^{1/3}$. If the Rh you measure experimentally (e.g., via dynamic light scattering) is significantly larger than this calculated value, it could be a sign that the protein is aggregated or has adopted an unfolded, expanded structure. Conversely, if it's smaller than the calculated value, the structure is likely very compact. Comparing calculated and measured values gives you qualitative information about your sample's state.

How to Use

Enter molecular weight in kDa (e.g., 66.5 for bovine serum albumin)
Input amino acid sequence length or use estimated length from MW (divide kDa by 0.110 for average residue mass)
Specify isoelectric point (pI) from literature or prediction tools; for human hemoglobin pI ≈ 6.8
Set experimental pH value (pH 7.4 for physiological buffer, pH 3.0 for acidic conditions)
Click calculate to obtain net charge, Stokes radius, diffusion coefficient (D), sedimentation coefficient (S), and predicted SDS-PAGE migration distance

Worked Example

Consider recombinant human insulin-like growth factor I (IGF-I): MW = 7.6 kDa, length = 70 amino acids, pI = 8.4, working pH = 7.4. At pH 7.4 (below pI 8.4), net charge is approximately +3.2. Stokes radius calculates to ~1.9 nm using Perrin formula. Diffusion coefficient D ≈ 8.2 × 10⁻⁷ cm²/s. Sedimentation coefficient S ≈ 0.64 S (Svedberg). On 16% SDS-PAGE with 4% stacking gel, expect migration to ~3–4 kDa position due to glycosylation reduction or disulfide reduction affecting mobility.

Practical Notes

Net charge varies sharply near pI; at pH = pI, charge ≈ 0 and protein precipitates (exploit for purification of alpha-amylase, pI 5.2, by acetone precipitation at pH 5.2)
Stokes radius assumes spherical hydrated protein; elongated proteins (collagen, fibrin) have larger radii and slower diffusion despite lower MW
SDS-PAGE migration depends on charge-to-mass ratio after detergent binding (~1.4 g SDS per gram protein); glycosylated proteins migrate anomalously higher
Sedimentation coefficients require density and viscosity corrections for non-aqueous buffers containing glycerol or sucrose