Identifying Essential Proteins in Dynamic PPI Network with Improved FOA

Xiujuan Lei, Siguo Wang, Linqiang Pan

Abstract


Identification of essential proteins plays an important role for understanding the cellular life activity and development in postgenomic era. Identification of essential proteins from the protein-protein interaction (PPI) networks has become a hot topic in recent years. In this work, fruit fly optimization algorithm (FOA) is extended for identifying essential proteins, the extended algorithm is called EPFOA, which merges FOA with topological properties and biological information for essential proteins identification. The algorithm EPFOA has the advantage of identifying multiple essential proteins simultaneously rather than completely relying on ranking score identification individually. The performance of EPFOA is analyzed on dynamic PPI networks, which are constructed by combining the gene expression data. The experimental results demonstrate that EPFOA is more efficient in detecting essential proteins than the state-of-the-art essential proteins detection methods.

Keywords


essential proteins, protein-protein interaction (PPI), dynamic PPI networks, subcellular localization data, fruit fly optimization algorithm (FOA)

Full Text:

PDF

References


Binder, J. X., Pletscher-Frankild, S., Tsafou, K., Stolte, C., O'Donoghue, S. I., Schneider, R., Jensen, L. J. (2014); COMPARTMENTS: Unification and Visualization of Protein Subcellular Localization Evidence, Database, bau012, 2014.

Bocu, R., Tabirca, S. (2011); The Flag-based Algorithm - A Novel Greedy Method that Optimizes Protein Communities Detection, International Journal of Computers Communications & Control, 6(1), 33-44, 2011.
https://doi.org/10.15837/ijccc.2011.1.2198

Bonacich, P. (1987); Power and Centrality: A Family of Measures, American Journal of Sociology, 92(5), 1170-1182, 1987.
https://doi.org/10.1086/228631

Cherry, J. M., Adler, C., Ball, C., Chervitz, S. A., Dwight, S. S., Hester, E. T., Schroeder, M. (1998); SGD: Saccharomyces Genome Database, Nucleic Acids Research, 26(1), 73, 1998.
https://doi.org/10.1093/nar/26.1.73

Consortium, G. O. (2015); Gene Ontology Consortium: Going Forward, Nucleic Acids Research, 43 (Database issue), 1049-1056, 2015.

Consortium, G. O., Blake, J. A., Dolan, M., Drabkin, H., Hill, D. P., Li, N., Buza, T. (2013); Gene Ontology Annotations and Resources, Nucleic Acids Research, 41(D1), 530-535, 2013.

Cullen, L. M., Arndt, G. M. (2005); Genome-Wide Screening for Gene Function Using RNAi in Mammalian Cells, Immunology Cell Biology, 83(3), 217-223, 2005.
https://doi.org/10.1111/j.1440-1711.2005.01332.x

Dzitac, I. (2015); Impact of Membrane Computing and P Systems in ISI WoS. Celebrating the 65th Birthday of Gheorghe Paun, International Journal of Computers Communications & Control, 10(5), 617–626, 2015.
https://doi.org/10.15837/ijccc.2015.5.2024

Estrada, E., Rodriguez-Velázquez, J. A. (2005); Subgraph Centrality in Complex Networks, Physical Review E Statistical Nonlinear Soft Matter Physics, 71(2), 056103, 2005.

Gavin, A. C., Aloy, P., Grandi, P., Krause, R., Boesche, M., Marzioch, M., Dampelfeld, B. (2006); Proteome Survey Reveals Modularity of The Yeast Cell Machinery, Nature, 440(7084), 631-636, 2006.
https://doi.org/10.1038/nature04532

Giaever, G., Chu, A. M., Ni, L., Connelly, C., Riles, L., Véronneau, S., André, B. (2002); Functional Profiling of the Saccharomyces Cerevisiae Genome, Nature, 418(6896), 387, 2002.
https://doi.org/10.1038/nature00935

Gill, N., Singh, S., Aseri, T. C. (2014); Computational Disease Gene Prioritization: An Appraisal, Journal of Computational Biology A Journal of Computational Molecular Cell Biology, 21(6), 456-465, 2014.

Hsing, M., Byler, K. G.,Cherkasov, A. (2008); The Use of Gene Ontology Terms for Predicting Highly-Connected 'Hub' Nodes in Protein-Protein Interaction Networks, BMC Systems Biology, 2(1), 1-14, 2008.

Jeong, H., Mason, S. P., Barabási, A. L., Oltvai, Z. N. (2001); Lethality and Centrality in Protein Networks, Nature, 411(6833), 41-42, 2001.
https://doi.org/10.1038/35075138

Jimenezsanchez, G., Childs, B., Valle, D. (2001); Human Disease Genes, Nature, 409(6822), 853-855, 2001.
https://doi.org/10.1038/35057050

Lei, X., Wang, F., Wu, F. X., Zhang, A., Pedrycz, W. (2016); Protein Complex Identification Through Markov Clustering with Firefly Algorithm on Dynamic Protein-Protein Interaction Networks, Information Sciences, 329(6), 303-316, 2016.

Lei, X., Wang, S., Pan, L. (2017); Predicting Essential Proteins Based on Gene Expression Data, Subcellular Localization and PPI Data. Bio-inspired Computing: Theories and Applications: 12th International Conference, Proceedings of, 92-105, 2017.
https://doi.org/10.1007/978-981-10-7179-9_8

Li, M., Lu, Y., Wang, J., Wu, F. X., Pan, Y. (2015); A Topology Potential-Based Method for Identifying Essential Proteins from PPI Networks, IEEE/ACM Transactions on Computational Biology Bioinformatics, 12(2), 372, 2015.
https://doi.org/10.1109/TCBB.2014.2361350

Li, M., Wang, J., Chen, X., Wang, H., Pan, Y. (2011); A Local Average Connectivity-Based Method for Identifying Essential Proteins from the Network Level, Computational Biology Chemistry, 35(3), 143-150, 2011.
https://doi.org/10.1016/j.compbiolchem.2011.04.002

Li, M., Wang, J., Wang, H., Pan, Y. (2012); Identification of Essential Proteins Based on Edge Clustering Coefficient, IEEE/ACM Transactions on Computational Biology Bioinformatics, 9(4), 1070, 2012.
https://doi.org/10.1109/TCBB.2011.147

Li, M., Zhang, H., Wang, J. X., Pan, Y. (2012); A New Essential Protein Discovery Method Based on the Integration of Protein-Protein Interaction and Gene Expression Data, BMC Systems Biology, 6(1), 15, 2012.
https://doi.org/10.1186/1752-0509-6-15

Luo, J., Kuang, L. (2014); A New Method for Predicting Essential Proteins Based on Dynamic Network Topology and Complex Information, Computational Biology Chemistry, 52(C), 34, 2014.
https://doi.org/10.1016/j.compbiolchem.2014.08.022

Mewes, H. W., Frishman, D., Mayer, K. F. X., Münsterkötter, M., Noubibou, O., Pagel, P., St¨šmpflen, V. (2006); MIPS: Analysis and Annotation of Proteins from Whole Genomes in 2005, Nucleic Acids Research, 34 (Database issue), 169-172, 2006.

Newman, M. E. J. (2005); A Measure of Betweenness Centrality Based on Random Walks, Social Networks, 27(1), 39-54, 2005.
https://doi.org/10.1016/j.socnet.2004.11.009

Pan, W. T. (2012); A New Fruit Fly Optimization Algorithm: Taking the Financial Distress Model as an Example, Knowledge-Based Systems, 26(2), 69-74, 2012.
https://doi.org/10.1016/j.knosys.2011.07.001

Pan, L., Paun, Gh. (2009); Spiking Neural P Systems with Anti-Spikes. International Journal of Computers Communications & Control, 4(3), 273–282, 2009.
https://doi.org/10.15837/ijccc.2009.3.2435

Pál, C., Papp, B., Hurst, L. D. (2003); Genomic function: Rate of Evolution and Gene Dispensability, Nature, 421(6922), 496-497, 2003.
https://doi.org/10.1038/421496b

Paun, Gh. (2000); Computing with Membranes, Journal of Computer and System Sciences, 61(1), 108–143, 2000.
https://doi.org/10.1006/jcss.1999.1693

Paun, Gh. (2016); Membrane Computing and Economics: A General View, International Journal of Computers Communications & Control, 11(1), 105–112, 2016.
https://doi.org/10.15837/ijccc.2016.1.2160

Peng, W., Wang, J., Cheng, Y., Lu, Y., Wu, F., Pan, Y. (2015); UDoNC: An Algorithm for Identifying Essential Proteins Based on Protein Domains and Protein-Protein Interaction Networks, Computational Biology Bioinformatics IEEE/ACM Transactions on, 12(2), 276- 288, 2015.

Przytycka, T. M., Singh, M., Slonim, D. K. (2010); Toward the Dynamic Interactome: It's about Time, Briefings in Bioinformatics, 11(1), 15-29, 2010.
https://doi.org/10.1093/bib/bbp057

Qin, C., Sun, Y., Dong, Y. (2017); A New Computational Strategy for Identifying Essential Proteins Based on Network Topological Properties and Biological Information, PLoS ONE, 12(7), e0182031, 2017.
https://doi.org/10.1371/journal.pone.0182031

Radicchi, F., Castellano, C., Cecconi, F., Loreto, V., Parisi, D. (2004); Defining and Identifying Communities in Networks, Proceedings of the National Academy of Sciences of the United States of America, 101, 2658-2663, 2004.
https://doi.org/10.1073/pnas.0400054101

Ren, J., Wang, J., Li, M., Wang, H., Liu, B. (2011); Prediction of Essential Proteins by Integration of PPI Network Topology and Protein Complexes. Information Bioinformatics Research and Applications - International Symposium, Isbra 2011, Changsha, China, May 27-29, 2011. Proceedings of, 12-24, 2011.
https://doi.org/10.1007/978-3-642-21260-4_6

Roemer, T., Jiang, B., Davison, J., Ketela, T., Veillette, K., Breton, A., Marta, C. (2003); Large-Scale Essential Gene Identification in Candida Albicans and Applications to Antifungal Drug Discovery, Molecular Microbiology, 50(1), 167-181, 2003.
https://doi.org/10.1046/j.1365-2958.2003.03697.x

Schlicker, A., Lengauer, T., Albrecht, M. (2010); Improving Disease Gene Prioritization Using the Semantic Similarity of Gene Ontology Terms, Bioinformatics, 26(18), i561, 2010.
https://doi.org/10.1093/bioinformatics/btq384

Song, B., Pan, L., Pérez-Jiménez, M. J. (2016); Cell-Like P Systems with Channel States and Symport/Antiport Rules, IEEE Transactions on NanoBioscience, 15(6), 555–566, 2016.
https://doi.org/10.1109/TNB.2016.2594192

Song, B., Song, T., Pan, L. (2017); A Time-Free Uniform Solution to Subset Sum Problem by Tissue P Systems with Cell Division, Mathematical Structures in Computer Science, 27(1), 17–32, 2017.
https://doi.org/10.1017/S0960129515000018

Song, B., Zhang, C., Pan, L. (2017); Tissue-Like P Systems with Evolutional Symport/Antiport Rules, Information Sciences, 378, 177–193, 2017.
https://doi.org/10.1016/j.ins.2016.10.046

Stephenson, K., Zelen, M. (1989); Rethinking centrality: Methods and Examples, Social Networks, 11(1), 1-37, 1989.
https://doi.org/10.1016/0378-8733(89)90016-6

Tang, X., Wang, J., Zhong, J., Pan, Y. (2014); Predicting Essential Proteins Based on Weighted Degree Centrality, IEEE/ACM Transactions on Computational Biology Bioinformatics, 11(2), 407-418, 2014.
https://doi.org/10.1109/TCBB.2013.2295318

Tang, X. W. (2017); Predicting Essential Proteins Using a New Method, Intelligent Computing Theories and Application: 13th International Conference, ICIC 2017, Liverpool, UK, August 7-10, Proceedings of, Part II, 301-308, 2017.

Tang, Y., Li, M., Wang, J., Pan, Y., Wu, F. X. (2015); CytoNCA: A Cytoscape Plugin for Centrality Analysis and Evaluation of Protein Interaction Networks, BioSystems, 127, 67-72, 2015.
https://doi.org/10.1016/j.biosystems.2014.11.005

Tu, B. P., Mcknight, S. L. (2005); Logic of the Yeast Metabolic Cycle: Temporal Compartmentalization of Cellular Processes, Science, 310(5751), 115, 2005.

Wang, J., Peng, X., Li, M., Luo, Y., Pan, Y. (2011); Active Protein Interaction Network and Its Application on Protein Complex Detection, IEEE International Conference on Bioinformatics and Biomedicine, 37-42, 2011.

Wang, J., Peng, X., Peng, W., Wu, F. X. (2014); Dynamic Protein Interaction Network Construction and Applications, Proteomics, 14(4-5), 338-352, 2014.
https://doi.org/10.1002/pmic.201300257

Wang, J. Z., Du, Z., Payattakool, R., Yu, P. S., Chen, C. F. (2007); A New Method to Measure the Semantic Similarity of GO Terms, Bioinformatics, 23(10), 1274, 2007.
https://doi.org/10.1093/bioinformatics/btm087

Wang, L., Zheng, X. L., Wang, S. Y. (2013); A Novel Binary Fruit Fly Optimization Algorithm for Solving The Multidimensional Knapsack Problem, Knowledge-Based Systems, 48(2), 17-23, 2013.

Watts, D. J., Strogatz, S. H. (1998); Collective Dynamics of 'Small-World' Networks, Nature, 393(6684), 440, 1998.
https://doi.org/10.1038/30918

Winzeler, E. A., Shoemaker, D. D., Astromoff, A., Liang, H., Anderson, K., Andre, B., Bussey, H. (1999); Functional Characterization of the S. cerevisiae Genome by Gene Deletion and Parallel Analysis, Science, 285(5429), 901-906, 1999.
https://doi.org/10.1126/science.285.5429.901

Wuchty, S. (2001); Scale-Free Behavior in Protein Domain Networks, Molecular Biology Evolution, 18(9), 1694, 2001.
https://doi.org/10.1093/oxfordjournals.molbev.a003957

Wuchty, S., Stadler, P. F. (2003); Centers of Complex Networks, Journal of Theoretical Biology, 223(1), 45, 2003.
https://doi.org/10.1016/S0022-5193(03)00071-7

Yan, W., Sun, H., Wei, D., Enrico, B., Gabriella, V., Ying, X., Liang, Y. (2014); Identification of Essential Proteins Based on Ranking Edge-Weights in Protein-Protein Interaction Networks, PLoS ONE, 9(9), e108716, 2014.
https://doi.org/10.1371/journal.pone.0108716

Zeng, X., Lin, W., Guo, M., Zou, Q. (2017). A comprehensive overview and evaluation of circular RNA detection tools, PLoS Computational Biology, 13(6), e1005420, 2017.
https://doi.org/10.1371/journal.pcbi.1005420

Zhang, R., Lin, Y. (2009); DEG 5.0, A Database of Essential Genes in both Prokaryotes and Eukaryotes, Nucleic Acids Research, 37 (Database issue), D455, 2009.
https://doi.org/10.1093/nar/gkn858

Zhang, X. F., Dai, D. Q., Ouyang, L., Yan, H. (2014); Detecting Overlapping Protein Complexes Based on a Generative Model with Functional and Topological Properties, BMC Bioinformatics, 15(1), 186, 2014.
https://doi.org/10.1186/1471-2105-15-186

Zhang, Y., Lin, H., Yang, Z., Wang, J. (2013); Construction of Ontology Augmented Networks for Protein Complex Prediction, PLoS ONE, 8(5), : e62077, 2013.
https://doi.org/10.1371/journal.pone.0062077

Zhao, B., Wang, J., Li, M., Wu, F. X., Pan, Y. (2014); Detecting Protein Complexes Based on Uncertain Graph Model, IEEE/ACM Transactions on Computational Biology Bioinformatics, 11(3), 486-497, 2014.
https://doi.org/10.1109/TCBB.2013.2297915

Zhu, C., Wu, C., Aronow, B. J., Jegga, A. G. (2014); Computational Approaches for Human Disease Gene Prediction and Ranking, Advances in Experimental Medicine Biology, 799, 69, 2014.
https://doi.org/10.1007/978-1-4614-8778-4_4




DOI: https://doi.org/10.15837/ijccc.2018.3.3285



Copyright (c) 2018 Xiujuan Lei, Siguo Wang, Linqiang Pan

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

CC-BY-NC  License for Website User

Articles published in IJCCC user license are protected by copyright.

Users can access, download, copy, translate the IJCCC articles for non-commercial purposes provided that users, but cannot redistribute, display or adapt:

  • Cite the article using an appropriate bibliographic citation: author(s), article title, journal, volume, issue, page numbers, year of publication, DOI, and the link to the definitive published version on IJCCC website;
  • Maintain the integrity of the IJCCC article;
  • Retain the copyright notices and links to these terms and conditions so it is clear to other users what can and what cannot be done with the  article;
  • Ensure that, for any content in the IJCCC article that is identified as belonging to a third party, any re-use complies with the copyright policies of that third party;
  • Any translations must prominently display the statement: "This is an unofficial translation of an article that appeared in IJCCC. Agora University  has not endorsed this translation."

This is a non commercial license where the use of published articles for commercial purposes is forbiden. 

Commercial purposes include: 

  • Copying or downloading IJCCC articles, or linking to such postings, for further redistribution, sale or licensing, for a fee;
  • Copying, downloading or posting by a site or service that incorporates advertising with such content;
  • The inclusion or incorporation of article content in other works or services (other than normal quotations with an appropriate citation) that is then available for sale or licensing, for a fee;
  • Use of IJCCC articles or article content (other than normal quotations with appropriate citation) by for-profit organizations for promotional purposes, whether for a fee or otherwise;
  • Use for the purposes of monetary reward by means of sale, resale, license, loan, transfer or other form of commercial exploitation;

    The licensor cannot revoke these freedoms as long as you follow the license terms.

[End of CC-BY-NC  License for Website User]


INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL (IJCCC), With Emphasis on the Integration of Three Technologies (C & C & C),  ISSN 1841-9836.

IJCCC was founded in 2006,  at Agora University, by  Ioan DZITAC (Editor-in-Chief),  Florin Gheorghe FILIP (Editor-in-Chief), and  Misu-Jan MANOLESCU (Managing Editor).

Ethics: This journal is a member of, and subscribes to the principles of, the Committee on Publication Ethics (COPE).

Ioan  DZITAC (Editor-in-Chief) at COPE European Seminar, Bruxelles, 2015:

IJCCC is covered/indexed/abstracted in Science Citation Index Expanded (since vol.1(S),  2006); JCR2016: IF=1.374. .

IJCCC is indexed in Scopus from 2008 (SNIP2016 = 0.701, SJR2016 =0.319):

Nomination by Elsevier for Journal Excellence Award Romania 2015 (SNIP2014 = 1.029): Elsevier/ Scopus

IJCCC was nominated by Elsevier for Journal Excellence Award - "Scopus Awards Romania 2015" (SNIP2014 = 1.029).

IJCCC is in Top 3 of 157 Romanian journals indexed by Scopus (in all fields) and No.1 in Computer Science field by Elsevier/ Scopus.