Translate Fasta header to UniprotSwissprot ID

get_UniprotID_from_fasta_header(df, idcolumn = "protein_Id")

Arguments

df

data.frame with FASTA headers in first column

idcolumn

Column name containing the IDs

Value

Returns the whole data.frame with a column called UniprotID containing UniprotSwissprot IDs instead of Fasta headers. This format is easier to use for subsequent ID mappings using the Annotationdbi package.

Examples

data("exampleContrastData", package = "prora")
get_UniprotID_from_fasta_header(exampleContrastData)
#> # A tibble: 3,824 x 3
#>    protein_Id            UniprotID estimate
#>    <chr>                 <chr>        <dbl>
#>  1 sp|Q9BRQ0|PYGO2_HUMAN Q9BRQ0      0.514 
#>  2 sp|P13645|K1C10_HUMAN P13645     -2.33  
#>  3 sp|Q9Y4F1|FARP1_HUMAN Q9Y4F1      0.240 
#>  4 sp|P14550|AK1A1_HUMAN P14550     -0.360 
#>  5 sp|P61011|SRP54_HUMAN P61011     -0.0803
#>  6 sp|Q96NC0|ZMAT2_HUMAN Q96NC0     -1.38  
#>  7 sp|Q15417|CNN3_HUMAN  Q15417      0.108 
#>  8 sp|P31350|RIR2_HUMAN  P31350     -0.468 
#>  9 sp|O60664|PLIN3_HUMAN O60664      1.83  
#> 10 sp|O95639|CPSF4_HUMAN O95639      0.199 
#> # ... with 3,814 more rows