MobiDB-lite
MobiDB-lite is an optimized method for highly specific predictions of long intrinsically disordered regions (IDRs). The method uses 8 different predictors to derive a consensus which is filtered for spurious short predictions in a second step. MobiDB-lite can be useful in large-scale annotation scenarios and has indeed already been integrated in the MobiDB, DisProt and InterPro databases. Download HereImplementations
MobiDB
MobiDB (1) was designed to offer a centralized resource for annotations of intrinsic protein disorder. The database features three levels of annotation: manually curated, indirect and predicted. MobiDB.DisProt
DisPot (2) is a community resource annotating protein sequences for intrinsically disorder regions from the literature. DisProt.InterPro
InterPro (3) provides functional analysis of proteins by classifying them into families and predicting domains and important sites. InterPro.Technology
MobiDB-lite is a resource combining 8 predictors in a consensus. IUPred (4) long and short, GlobPlot (5), DisEMBL (6) hot loops and 465 and the 3 flavors of ESpritz (7). The executables of these 8 predictors are launched and their output managed by a Python 2.7.11 wrapper.Dataset
All UniProt sequences with at least one X-ray annotation in MobiDB (Di Domenico et al., 2012) were downloaded on the May 13, 2013 (25 833 entries).Similar chains were removed at 90% pairwise sequence identity using CD-HIT (101 338 chains reduced to 24 669). For more details on the different datasets see (8).Citing MobiDB-lite
Necci M, Piovesan D, Clementel D, Dosztányi Z, Tosatto SCE. MobiDB-lite 3.0: fast consensus annotation of intrinsic disorder flavors in proteins. Bioinformatics. 2020.
Other references
- Piovesan D, Necci M, Escobedo N, Monzon AM, Hatos A, Mičetić I, Quaglia F, Paladin L, Ramasamy P, Dosztányi Z, Vranken WF, Davey NE, Parisi G, Fuxreiter M, Tosatto SCE. MobiDB: intrinsically disordered proteins in 2021 Nucleic Acids Res. 2021 Jan 8;49(D1):D361-D367. doi: 10.1093/nar/gkaa1058. PubMed PMID: 33237329; PubMed Central PMCID: PMC7779018.
- Hatos A, Hajdu-Soltész B, Monzon AM, Palopoli N, Álvarez L, Aykac-Fas B, Bassot C, Benítez GI, Bevilacqua M, Chasapi A, Chemes L, Davey NE, Davidović R, Dunker AK, Elofsson A, Gobeill J, Foutel NSG, Sudha G, Guharoy M, Horvath T, Iglesias V, Kajava AV, Kovacs OP, Lamb J, Lambrughi M, Lazar T, Leclercq JY, Leonardi E, Macedo-Ribeiro S, Macossay-Castillo M, Maiani E, Manso JA, Marino-Buslje C, Martínez-Pérez E, Mészáros B, Mičetić I, Minervini G, Murvai N, Necci M, Ouzounis CA, Pajkos M, Paladin L, Pancsa R, Papaleo E, Parisi G, Pasche E, Barbosa Pereira PJ, Promponas VJ, Pujols J, Quaglia F, Ruch P, Salvatore M, Schad E, Szabo B, Szaniszló T, Tamana S, Tantos A, Veljkovic N, Ventura S, Vranken W, Dosztányi Z, Tompa P, Tosatto SCE, Piovesan D. DisProt: intrinsic protein disorder annotation in 2020. Nucleic Acids Res. 2020 Jan 8;48(D1):D269-D276. doi: 10.1093/nar/gkz975. PubMed PMID: 31713636; PubMed Central PMCID: PMC7145575.
- Blum M, Chang HY, Chuguransky S, Grego T, Kandasaamy S, Mitchell A, Nuka G, Paysan-Lafosse T, Qureshi M, Raj S, Richardson L, Salazar GA, Williams L, Bork P, Bridge A, Gough J, Haft DH, Letunic I, Marchler-Bauer A, Mi H, Natale DA, Necci M, Orengo CA, Pandurangan AP, Rivoire C, Sigrist CJA, Sillitoe I, Thanki N, Thomas PD, Tosatto SCE, Wu CH, Bateman A, Finn RD. The InterPro protein families and domains database: 20 years on. The InterPro protein families and domains database: 20 years on. PubMed PMID: 33156333; PubMed Central PMCID: PMC7778928.
- Dosztányi Z, Csizmok V, Tompa P, Simon I. IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content. Bioinformatics. 2005 Aug 15;21(16):3433-4. PubMed PMID: 15955779.
- Linding R, Russell RB, Neduva V, Gibson TJ. GlobPlot: Exploring protein sequences for globularity and disorder. Nucleic Acids Res. 2003 Jul 1;31(13):3701-8. PubMed PMID: 12824398; PubMed Central PMCID: PMC169197.
- Linding R, Jensen LJ, Diella F, Bork P, Gibson TJ, Russell RB. Protein disorder prediction: implications for structural proteomics. Structure. 2003 Nov;11(11):1453-9. PubMed PMID: 14604535.
- Walsh I, Martin AJ, Di Domenico T, Tosatto SC. ESpritz: accurate and fast prediction of protein disorder. Bioinformatics. 2012 Feb 15;28(4):503-9. doi: 10.1093/bioinformatics/btr682. PubMed PMID: 22190692.
- Walsh I, Giollo M, Di Domenico T, Ferrari C, Zimmermann O, Tosatto SC. Comprehensive large-scale assessment of intrinsic protein disorder. Bioinformatics. 2015 Jan 15;31(2):201-8. doi: 10.1093/bioinformatics/btu625. PubMed PMID: 25246432.