• Methods for the bioinformatic identification of bacterial lipoproteins encoded in the genomes of Gram-positive bacteria

      Rahman, O.; Cummings, S.P.; Harrington, Dean J.; Sutcliffe, I.C. (2008-11)
      Bacterial lipoproteins are a diverse and functionally important group of proteins that are amenable to bioinformatic analyses because of their unique signal peptide features. Here we have used a dataset of sequences of experimentally verified lipoproteins of Gram-positive bacteria to refine our previously described lipoprotein recognition pattern (G+LPP). Sequenced bacterial genomes can be screened for putative lipoproteins using the G+LPP pattern. The sequences identified can then be validated using online tools for lipoprotein sequence identification. We have used our protein sequence datasets to evaluate six online tools for efficacy of lipoprotein sequence identification. Our analyses demonstrate that LipoP (http://www.cbs.dtu.dk/services/LipoP/) performs best individually but that a consensus approach, incorporating outputs from predictors of general signal peptide properties, is most informative.