Skip to main content
The NCI Community Hub will be retiring in May 2024. For more information please visit the NCIHub Retirement Page:https://ncihub.cancer.gov/groups/ncihubshutdown/overview
close
  • Discoverability Visible
  • Join Policy Restricted
  • Created 22 Jan 2014

PSON0008 Proteomic Characterization of Cell Lines

Version 47
by (unknown)
Version 48
by (unknown)

Deletions or items before changed

Additions or items after changed

1 ''' Study Title ''' Proteomic Characterization of Cell Lines
2
3 This page: [https://nciphub.org/groups/nci_physci/wiki/PSON0008]
4 +
5 +
* Download the dataset at [ftp://caftpd.nci.nih.gov/psondcc/PhysicalCharacterization/Proteomics]
6 +
* The project's entry at Synapse.org [https://www.synapse.org/#!Synapse:syn9697791]
7
8 ''' Study Contact ''' Parag Mallick; email: paragm@stanford.edu
9
10 ''' OVERVIEW '''
11
12 The Janmey lab at the
13 University of Pennsylvania grew each of 9 cell lines from Physical Sciences-Oncology Network Bioresource Core Facility (PBCF)
14 [https://physics.cancer.gov/bioresources/] on each of seven growth matrices for a total of 63 different samples.
15 Approximately nine million cells were initially plated (across 6 - 9
16 plates) for each condition. Cells were grown for 24 hours. After 24
17 hours, cells were isolated, pelleted, flash frozen and then
18 transferred to Stanford for proteomics analysis. Proteins were extracted from the pellets.
19 Next, a small portion of this was precipitated and used for QC
20 analysis. Specifically, protein amount was estimated by !MicroBCA. A
21 Coomassie stained gel was also run to verify that there was no
22 significant contamination or degradation. Next, allocations for each
23 of the downstream assays were determined and protein aliquoted. Each
24 of these aliquots was precipitated and then directed to its subsequent
25 assay (iBAQ, TMT and Phospho), as detailed below.
26
27 ''' Figure 1 . Growth and transfer of cell line extracts'''
28
29 [[Image(Figure1.png, 400px)]]
30
31 ----
32
33 ''' iBAQ OVERVIEW '''
34
35 The goal iBAQ analysis was twofold. First, we wanted to generally
36 verify that the sample was of sufficient quality to support broader
37 proteomics analysis. Second, we wanted to generate data that might be
38 used for either spectral count or iBAQ analysis to support absolute
39 quantification.
40
41 ''' Sample Preparation and Mass Spectrometric Analysis Workflow '''
42
43 Our overall iBAQ process begins with the precipitated protein generated as described above. This precipitate was re-suspended and then split into three aliquots. To facilitate iBAQ analysis, UPS2, a standard protein mixture (Sigma) was spiked into one of the aliquots. We denote this aliquot plusUPS2. As downstream analysis of these aliquots was performed independently they serve as sample preparation triplicates, which we refer to as preplicates. As shown in Figure 2, each preplicate was processed using our iBAQ protocol to generate tryptic peptides. Duplicate injections of these peptides were analyzed by a Thermo LTQ Orbitrap Velos generating a family of approximately 6 raw files per condition.
44
45 ''' Figure 2. iBAQ Sample Preparation Overview '''
46 [[Image(Fig3.png, 700px)]]
47
48
49
50 ''' iBAQ ANALYSIS '''
51
52 Raw instrument files (*raw) are converted to an intermediary file (mzXML format) using !ProteoWizard msConvert [http://proteowizard.sourceforge.net]. Next, files are uploaded to Labkey Server (http://www.labkey.org). Labkey server uses a typical Transproteomic Pipeline Process to match MS data to peptides and proteins from the Human2015.fasta FASTA Database. Results are stored as open-format pepXML and protXML files. We have additionally used Labkey server to convert these results into a simple .tsv file.
53
54 ''' Figure 3. iBAQ Data Analysis Overview '''
55
56 [[Image(Fig4.png, 500px)]]
57
58 ----
59
60 '''PHOSPHO OVERVIEW'''
61
62 Our overall phosphopeptide process begins with the precipitated protein generated as described above. This precipitate was re-suspended and then split into aliquots. 0-3 aliquots were generated depending upon sample amount. As above, downstream analysis of these aliquots was performed independently. Consequently, they serve as sample preparation replicates (when possible), which we refer to as preplicates. Each preplicate was processed using our phospho-enrichment protocol to generate an enriched pool of phosphorylated tryptic peptides. Briefly, between 250 and 750ug of protein is digested with trypsin into peptides. These peptides are then run through a !TiO2 column to enrich for phosphopeptides and then a graphite column to clean and desalt the peptides. Enriched phosphopeptides were analyzed by a Thermo Orbitrap Fusion to generate a family of up to 3 preplicate raw files per condition.
63
64 ''' Figure 4. Phospho Sample Preparation Overview'''
65 [[Image(Fig5.png, 700px)]]
66
67
68 ''' PHOSPHO ANALYSIS '''
69
70 Phospho data was analyzed by Proteome Discoverer software [http://www.thermoscientific.com/en/product/proteome-discoverer-software.html] to match MS data to peptides and proteins from the Uniprot_HUM_031914_cRAP.fasta FASTA file. Proteome discoverer produces an .msf file which is converted to an easily readable tsv file. The .msf (Magellan Storage File) extension files are SQLite files generated by the Proteome Discoverer. The .msf format is another file format (like pep.xml, prot.xml) to capture all the relevant output information from the Proteome Discoverer search results.
71
72 ''' Figure 5. Phsopho Analysis Overview '''
73
74 [[Image(Fig6.png, 500px)]]
75
76 ----
77 '''TMT OVERVIEW'''
78
79 The goal of Tandem-Mass-Tag (TMT) analysis is to provide deep relative quantification across conditions. Unlike the phospho and iBAQ studies where there are RAW files for each condition, TMT data spans multiple substrates.
80
81 ''' TMT Sample Preparation and Mass Spectrometric Analysis Workflow '''
82
83 Our overall TMT workflow begins with the precipitated protein generated as described above. The precipitate for each condition was re-suspended and then split into 3 preplicate aliquots. For each preplicate, the 7 substrate conditions are digested into peptides and then labeled with a TMT reagent. For example, in the figure below, substrate S1, is labeled with TMT-127; substrate S2 is labeled with TMT-128, etc.
84
85 Next, the 7 vials of labeled peptides are combined with a reference lysate (a pool of 4 cell lines) and two internal replicates to make a TMT-10-plex mixture. This 10-plex mixture is fractionated to achieve greater depth. The TMT-10-plex is fractionated by high-PH reverse phase fractionation into 8 reverse-phase fractions (RP Fractions). These 8 fractions are subsequently re-combined into 3 fractions. Fractions were re-combined for cost and time reasons. Recombination was done as follows: RP fractions 1,4 and 7 were pooled to make one pooled sample; RP fractions 2,5,8 were pooled to make the second sample; RP fractions 3 and 6 were pooled to make the third pooled sample. These samples were analyzed by a Thermo Orbitrap Fusion to generate three raw files per cell line preplicate.
86
87 ''' Figure 6. TMT Sample Preparation Overview'''
88
89 [[Image(Fig7.png, 500px)]]
90
91 ''' TMT ANALYSIS '''
92
93 TMT data was analyzed by Proteome Discoverer software (Link) to match MS data to peptides and proteins from the Uniprot_HUM_031914_cRAP.fasta FASTA
94 Database. Proteome discoverer produces an .msf file which is converted to an easily readable tsv file. The .msf (Magellan Storage File) extension files are SQLite files generated by the Proteome Discoverer. The .msf format is another file format (like pep.xml, prot.xml) to capture all the relevant output information from the Proteome Discoverer search results.
95
96 ''' Figure 7. TMT Analysis Overview '''
97
98 [[Image(Fig8.png, 500px)]]
99
100 '''OTHER REFERENCES'''
101
102 Proteome Discoverer User Guide
103 [https://tools.thermofisher.com/content/sfs/manuals/Man-XCALI-97506-Proteome-Discoverer-14-User-ManXCALI97506-A-EN.pdf]
104
105 Quantification with Proteome Discoverer
106 [https://tools.thermofisher.com/content/sfs/manuals/Quantification-with-Proteome-Discoverer-1-2.pdf]
107
108 Descriptions of proteome discoverer column results for phospho samples can be found at:
109 [https://sites.psu.edu/msproteomics/tag/psm/]
110
111
112
113 '''Data usage policy'''
114
115 The data contained within the PS-ON DCC is based on several research projects and is intended to be rapidly and constantly updated for the research community to access and use. The NCI requests that any data users:
116
117 Inform the data submitters about the intention to submit a publication that uses PS-ON DCC data. Include the following statement in any publications resulting from the use of PS-ON DCC data: ''Data used in this publication were generated by projects sponsored by the NCI Physical Sciences in Oncology Initiative.''