bioinformatics_tools3
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| bioinformatics_tools3 [2021/10/08 17:00] – 134.190.232.9 | bioinformatics_tools3 [2022/03/15 10:28] (current) – 134.190.232.106 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | **Approach Two: submit one loop shell script** | + | **Parsing the InterProScan results** |
| + | InterProScan is a powerful and useful protein identifier explorer.However, | ||
| + | |||
| + | Before: | ||
| < | < | ||
| + | AT1G11750.1 e35d0d54d5d209033484e31afc0ff009 271 Coils Coil Coil 210 230 - T 27-09-2021 - - | ||
| + | AT1G11750.1 e35d0d54d5d209033484e31afc0ff009 271 Hamap MF_00444 ATP-dependent Clp protease proteolytic subunit [clpP]. 84 270 33.141739 T 27-09-2021 IPR001907 ATP-dependent Clp protease proteolytic subunit GO: | ||
| + | AT1G11750.1 e35d0d54d5d209033484e31afc0ff009 271 CDD cd07017 S14_ClpP_2 96 264 3.96818E-88 T 27-09-2021 IPR001907 ATP-dependent Clp protease proteolytic subunit GO: | ||
| + | AT1G11750.1 e35d0d54d5d209033484e31afc0ff009 271 ProSitePatterns PS00382 Endopeptidase Clp histidine active site. 189 202 - T 27-09-2021 IPR033135 ClpP, | ||
| + | AT1G11750.1 e35d0d54d5d209033484e31afc0ff009 271 Pfam PF00574 Clp protease 96 269 3.7E-65 T 27-09-2021 IPR023562 Clp protease proteolytic subunit / | ||
| + | </ | ||
| - | #!/bin/bash | + | After: |
| - | #$ -S /bin/bash | + | < |
| - | . / | + | AT1G11750.1 PF00574 Clp protease |
| - | #$ -cwd | + | AT5G53350.1 PF10431, |
| - | #$ -o logfile | + | AT4G17040.1 PF00574 Clp protease |
| - | #$ -pe threaded 24 | + | AT5G15450.1 PF10431, |
| - | #export PATH=/scratch2/ | + | AT5G51070.1 PF07724, |
| + | </code> | ||
| - | while read line | + | Because we only displayed the one contain " |
| - | do | + | |
| - | mafft --auto --thread 24 / | + | * Step One: Running the InterProScan with your interested protein |
| - | /scratch2/ | + | < |
| + | >/interproscan.sh | ||
| + | </code> | ||
| - | FastTree / | + | If you are new to InterProScan, |
| - | done <$1 | + | * Step Two: Run the interproscan_to_one_line.py script which can be found via (https:// |
| + | |||
| + | <code> | ||
| + | python3 interproscan_to_one_line.py test_data.tsv out.txt Pfam | ||
| </ | </ | ||
| + | |||
| + | |||
| + | **InterproScan data result for human-being** | ||
| + | |||
| + | Homo_sapiens: | ||
| + | GCF_000001405.39_GRCh38.p13_protein.faa | ||
| + | |||
| + | 1. This protein includes alternative splicing transcripts translated proteins (e.g., | ||
| + | Use this script (isoform2one https:// | ||
| + | |||
| + | 2. Then run the Interproscan analysis. Results documented here: / | ||
| <Last updated by Xi Zhang on Oct 8th, | <Last updated by Xi Zhang on Oct 8th, | ||
bioinformatics_tools3.1633723230.txt.gz · Last modified: by 134.190.232.9 · Currently locked by: 216.73.216.59
