Show gs_sorted-output-OsPLCa.html syntax highlighted
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML//EN">
<HTML><HEAD><TITLE>GeneSeqer Output</TITLE></HEAD><BODY bgcolor=white text=black link=blue><FONT FACE="Courier"><PRE>
<A NAME="TOP"></A>
<center><h3><a href="http://www.plantgdb.org/~volker/PLC/NEW/Summary.html" target="_blank">Click here to access the result summary and navigation page</a></h3></center><br>GeneSeqer. Version of May 5, 2004.
Date run: Sun Aug 8 22:36:43 2004
(Bayesian) Splice site model (species): Arabidopsis thaliana
Fast search parameters: MinMatchLen 12, MinQualityHSP 12, MinQualityCHAIN 30.
GenBank file: AC137075
________________________________________________________________________________
<A NAME="AC137075-45000-50999"></A>
Sequence 1: <A HREF="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Search&db=Nucleotide&term=AC137075&doptcmdl=GenBank" TARGET="NUCLEOTIDE SEQUENCE SEARCH">AC137075</A>, from 45001 to 51000.
... started at: Sun Aug 8 22:36:43 2004
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST library file: LIBestA; matching gDNA +strand ...
... found all matches, elapsed seconds = 0
... matches indexed, elapsed seconds = 0 HitsTableSize = 8
EST library file: LIBestA; matching gDNA -strand ...
... found all matches, elapsed seconds = 0
... matches indexed, elapsed seconds = 0 HitsTableSize = 2
<A NAME="PGS8@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 8 +strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=32980476" TARGET="NUCLEOTIDE SEQUENCE SEARCH">32980476+</A>)
1 GGGAATTAAT AAAATGTAGT GACCGTGCCG ACGGAGGAAG ACGAAGACAG GAGACGGTTA
61 CTGGAGTTGA GTACTCCACC CACCACACCA CACAAGAGAG AGAAGGGTTG GGGTTGGGGT
121 CACTCTTTCC TTTTCTCTGT TGTTGTTGGC TTGATGATGA GCTGATTCGC ATAATTCCAA
181 TTCCAATTCC AATTATAATT CCAAGAGGCT TATCATCAAC TCAACTGAAG AAGAAGAAGA
241 AGAAGAAGAA GAAGAAGAGG AGGAGGGTTG TGAACTGATT TGGGAAGGGA GGGGGAGGAG
301 GAAGGCGATC GATCGGCGAT GGGCACGTAC AAGTGCTGCC TCATCTTCAA GCGCCGCTTC
361 CGCTGGAACG ACGCGCCGCC GCCCGACGAT GTCCGCGCCC TCTTCGCCAA CCACTCCGCC
421 GGCGGTGGCC CCCACATGGC CGCCGACGGC CTCCGCGCCT ACCTCCAGGC CACCGGCCAG
481 GACGGCGACG TGGACATGGA GCGGCTGGTG GAGCAGATCC GGCAGCTGCA GGGGCGCGGC
541 GGGCGCATCC CGCGGGTGGG GCGGGCACTC CCACTCCTGA CGGTGGACGA CTTCCACCGA
601 TTCCTCTTCT CCCACGAGCT GAACCCACCC ATCCGGCACG GGCAGGGGCA GGTGCACCAC
661 GACATGGCCG CCCCGCTCTC CCACTACTTC ATCTACACCG GCCACAACTC CTACCTCACC
721 GGCAACCAGC TCAGCAGCGA CTGCAGCGAC CTCCCCATCA TCAGGGCTCT CCAGAGGGGC
781 GTCCGCGTCA TCGAGCTCGA CATGTGGCCC AACTCCTCCA AGGATGACAT CAGCATCCTC
841 CATGGCAGGA CGCTCACCAC CCCGGTCTCC CTCCTCAAAT GCCTCCTCTC CATCAAGCAA
901 CACGCCTTTG AGGCCTCCCC TTACCCGGTT ATCATCACGC TCGAAGACCA CCTCACCCCC
961 GATCTCCAGG ACAAAGCAGC CAAGATGGTT CTTGAAGTTT TCGGCGACAT CCTCTACTAC
1021 CCTGACAAAG ATCACCTCAA AGAGTTCCCT TCGCCTCAAG ACCTCAAGGG CCGTGTCCTC
1081 CTCTCCACCA AGCCCCCCAG GGAGTACCTT CAAGCCAAGG ATGGTAATGC TGCCACCATC
1141 AAAGAGGACG CCAAGGCCGC CGCCACTGAC GATGCCGCAT GGGGAAAAGA AGTCCCAGAT
1201 ATTCACTCTC AAATCCACTC TGCCACTAAA CATGACCAAA GAGAAGATGA CGACGACACC
1261 GATGAAGACG AAGATGACGA GGAGGAGGAG CAGAAAATGC AACAGCATCT AGCTCCACAG
1321 TACAAACACC TTATTACCAT CAAAGCAGGA AAGCCAAAAG GTACTCTACT TGATGCCTTA
1381 CAGAGTGACC CAGAAAAGGT TAGAAGGCTC AGTTTGAGCG AGCAACAACT TGCCAAATTG
1441 GCAGATCATC ATGGTACCGA AATTGTAAGG TTCACACAGA GAAACCTACT GAGGATATAC
1501 CCAAAGGGCA CTCGGGTCAC ATCATCCAAC TATAATCCAT TTCTTGGTTG GGTGCATGGT
1561 GCTCAGATGG TAGCGTTCAA TATGCAGGGA TATGGAAGAG CTCTTTGGTT GATGCATGGA
1621 TTTTATAAAG CTAATGGTGG CTGTGGTTAT GTGAAGAAAC CAGATTTCTT AATGCAAACT
1681 GATCCAGAGG TTTTTGACCC AAAAAAATCC CTATCTCCCA AGAAAACCTT GAAGGTGAAA
1741 GTATACATGG GGGATGGTTG GCGGATGGAC TTCACGCAGA CCCACTTTGA TCAATATTCT
1801 CCTCCAGACT TTTATGCACG GGTGGGGATA GCGGGAGTAC CAGCGGACTC GGTGATGAAG
1861 AGAACGAGGG CGATAGAGGA TAACTGGGTG CCGGTGTGGG AGGAGGATTT CACCTTCAAA
1921 CTGACCGTGC CGGAGATCGC GTTGCTGCGG GTGGAGGTGC ACGAGTACGA CATGTCGGAG
1981 AAGGACGACT TCGGCGGCCA GACGGTGCTG CCGGTGTCGG ATCTCATCCC GGGGATCCGA
2041 GCGGTGGCAC TCCACGACCG CAAAGGGATC AAGTTGAACA ACGTCAAGCT TCTCATGCGC
2101 TTCGAGTTTG AATGACCCAA CACACCGACA CTTTCTTTCT TTCTCGCCGC ATCGCATTGC
2161 ACTGTGCCTG TGCTTGTGCA GCATCCATCA TTTGGTTTGG TTTTTCATGT TCCTGTGCAT
2221 ACGCATTTGT GTCTGTACAT AGGCTCGGTC CTGTATATTG TTTGTGAGTA ACATGTAATA
2281 ATAAGGCTTC ACGCCATGTT CATTCCG
Predicted gene structure (within gDNA segment 45001 to 51000):
Exon 1 45576 45626 ( 51 n); cDNA 267 315 ( 49 n); score: 0.569
Intron 1 45627 46117 ( 491 n); Pd: 0.892 (s: 0.56), Pa: 0.000 (s: 0.80)
Exon 2 46118 46439 ( 322 n); cDNA 316 648 ( 333 n); score: 0.612
Intron 2 46440 47301 ( 862 n); Pd: 0.852 (s: 0.68), Pa: 0.000 (s: 0.78)
Exon 3 47302 47501 ( 200 n); cDNA 649 848 ( 200 n); score: 0.750
Intron 3 47502 47600 ( 99 n); Pd: 0.981 (s: 0.74), Pa: 0.943 (s: 0.68)
Exon 4 47601 47736 ( 136 n); cDNA 849 984 ( 136 n); score: 0.721
Intron 4 47737 47813 ( 77 n); Pd: 0.967 (s: 0.74), Pa: 0.462 (s: 0.72)
Exon 5 47814 48074 ( 261 n); cDNA 985 1238 ( 254 n); score: 0.648
Intron 5 48075 48150 ( 76 n); Pd: 0.950 (s: 0.62), Pa: 0.864 (s: 0.53)
Exon 6 48151 48383 ( 233 n); cDNA 1239 1469 ( 231 n); score: 0.730
Intron 6 48384 48489 ( 106 n); Pd: 0.981 (s: 0.74), Pa: 0.477 (s: 0.82)
Exon 7 48490 48607 ( 118 n); cDNA 1470 1587 ( 118 n); score: 0.839
Intron 7 48608 48678 ( 71 n); Pd: 0.976 (s: 0.88), Pa: 0.976 (s: 0.80)
Exon 8 48679 48831 ( 153 n); cDNA 1588 1734 ( 147 n); score: 0.765
Intron 8 48832 48920 ( 89 n); Pd: 0.988 (s: 0.64), Pa: 0.739 (s: 0.82)
Exon 9 48921 49007 ( 87 n); cDNA 1735 1821 ( 87 n); score: 0.782
Intron 9 49008 49201 ( 194 n); Pd: 0.601 (s: 0.74), Pa: 0.941 (s: 0.76)
Exon 10 49202 49500 ( 299 n); cDNA 1822 2124 ( 303 n); score: 0.763
MATCH AC137075+ 32980476+ 0.712 1860 0.806 C
PGS_AC137075+_32980476+ (45576 45626,46118 46439,47302 47501,47601 47736,47814 48074,48151 48383,48490 48607,48679 48831,48921 49007,49202 49500)
Alignment (genomic DNA sequence = upper lines):
GTAGTTAACA GATGGTTGCA TGGAGCATCT GATCCATCAG GCGATCCATC CGTAGCTTTC 45635
|| || ||| ||| | | |||| || || |||||| |||
GTTGTGAACT GATTTGGGAA GGGAG--GGG GAGGAGGAAG GCGATCGATC G......... 315
TCTTAAAATG TTCCCTTATT TGCTTCAAAA TTCAATATCA TCTTTAAAAA TTGACAATAT 45695
.......... .......... .......... .......... .......... .......... 315
TAATAAATTA ATTATGTAAA ATTTTAATTC CAAATTCAAT TAATAGATAA AGAATTAATT 45755
.......... .......... .......... .......... .......... .......... 315
AAACTCAAAG TGGTAGAAGA ACTATTTAGT CAGATTATTT TAACTTGTAT GGGTTGAATA 45815
.......... .......... .......... .......... .......... .......... 315
TGAGCTTGAA TATTGGTGGA GTGATACATC GCTAAATTAC CTACTTTCAC AATTTTGAAG 45875
.......... .......... .......... .......... .......... .......... 315
TATGTTATAA ACTACTCTCA CAATTTTTAT AGTACATGCA TTTTTCCTTA AAGCAAAATG 45935
.......... .......... .......... .......... .......... .......... 315
ACTTCCCTAA CCATCCTCGG CACCTCCAAA TCGCCCTAAA AACTTGGAAT GCATGCGCGG 45995
.......... .......... .......... .......... .......... .......... 315
TGCGCGTTCC GCCTCACACG CCTCCGTCGG GAGCCAGACG CCGAGCCGGG ACGAGCGGGC 46055
.......... .......... .......... .......... .......... .......... 315
AAACAGGGAA AAAGCGTGTG CGCGGCGAAG GTCCATGAAG GCGGCGCCGG CGCCGGCGAC 46115
.......... .......... .......... .......... .......... .......... 315
GCGCAAATGG GGACGTACAA GTGCTGCATC TTCTTCACCC GCAGGTTCGC GCT-GAGCGA 46174
|| |||| | |||||||| ||||||| || |||||| | || | ||| | ||| || |||
..GC-GATGG GCACGTACAA GTGCTGCCTC ATCTTCAAGC GCCGCTTC-C GCTGGAACGA 371
CGCGTCCACG CCGGGCGACG TGCGCATGCT GTTCACCCGC CACGCCGGCG GC-G-CG-CC 46231
|||| | || || | ||| | | ||| || ||| || | ||| ||| || || | | ||
CGCGCCGCCG CCCGACGATG TCCGCGCCCT CTTCGCCAAC CACTCCGCCG GCGGTGGCCC 431
CTACATGGGC ATCGACGAGC TCCGGCGCTA CCTCGCCGCC AGCGGGGAGG CCCACGTCGA 46291
| |||||| | ||||| | |||| ||| |||| ||| | ||| ||| ||| |||
CCACATGGCC GCCGACGGCC TCCGCGCCTA CCTCCAGGCC ACCGGCCAGG ---ACGGCGA 488
CGCCGACACG GCGGAGC-GG ATCATC-GA- CCGGGTCCTG CAGGAGCGCA GC---CGCAC 46345
|| |||| | | | || || | | || |||| ||| |||| |||| || ||||
CGTGGACATG GAGCGGCTGG TGGAGCAGAT CCGGCAGCTG CAGGGGCGCG GCGGGCGCAT 548
CCCGC-GCT- --TCGGG-AA GCCGTC-GCT CACCATCGAC GATTTCCAGT ACTTCCTCTT 46399
||||| | | |||| | || | || || | ||| || ||||| ||||||||
CCCGCGGGTG GGGCGGGCAC TCCCACTCCT GACGGTGGAC GACTTCCACC GATTCCTCTT 608
CTCCGAGGAC CTCAACCCGC CCATCTGCCA TTCCAAGGAA GTAAGCAAAC TACCCGCTCG 46459
|||| | || || ||||| | ||||| | || |||
CTCCCACGAG CTGAACCCAC CCATCCGGCA CGGGCAGGGG .......... .......... 648
ATCCCCAATT TCCCAAATGC TGTTAGATTC ATCGTCATTC CGTGATAATC CTGCCGTTGC 46519
.......... .......... .......... .......... .......... .......... 648
ACAATGCGGT GAAATGGCGT AATTTGCTAG GATTCAGAAG GGGATTCTTG GGGTTTGTTT 46579
.......... .......... .......... .......... .......... .......... 648
AGTTCACATT AAAATTAAAA GTTTGGTTAA AATTGGAATG ATGTGACGAA AAGTTAGAAG 46639
.......... .......... .......... .......... .......... .......... 648
TTTGTGTGTG CAGGAAAGTT TTGATGCGAT GGAAAAGTTG GAAGTTTGAA GAAAAAAATT 46699
.......... .......... .......... .......... .......... .......... 648
AAAACTAAAC ATGGCTTTGG TCGGAACTGC TCTGTAGTGT GGACGTCATT CAAATCTTTA 46759
.......... .......... .......... .......... .......... .......... 648
TGAAGTATTT TTTTAAAGAT GGATCACACA TGTGATTAAC ATAGTTATAT AAAATTTTGT 46819
.......... .......... .......... .......... .......... .......... 648
TAAAATTTGA AAATGTAGAA TACGATGATA TAAATCACTA TATAAACATG CAAGTTTAAA 46879
.......... .......... .......... .......... .......... .......... 648
TTTGATCCAC GCAAAGAGAA AAAATATAAC CGATTATGTT TGAGTTGTGG CATTACTATT 46939
.......... .......... .......... .......... .......... .......... 648
TTCTATCTGG TTCTATTAAT TTTTTTTCTC CAATTGTAGA TCGAATCAAG CCTTTGTATG 46999
.......... .......... .......... .......... .......... .......... 648
TTTGTACATA GACTTATGCT ATCGTAATCT ACTCCCATTT TTTTTGGACG GAGGGAGTAT 47059
.......... .......... .......... .......... .......... .......... 648
GTTATCAATT TTAGTTTAAT TTTTTTTACA ACTATTTGGG TCACATACAA ATAACTGGCA 47119
.......... .......... .......... .......... .......... .......... 648
CATATGCACC TAGGTGTAAA GAAGTCAACA TGCAGGTAAT GAATTGAATT TCCATACAAC 47179
.......... .......... .......... .......... .......... .......... 648
ATTCTGCTCT CCTAAGAAAT TACGCTTACA AGTTCACTTG GATATTGCTA AACTCCATTT 47239
.......... .......... .......... .......... .......... .......... 648
TGATATTACT TAGTGTGTAC TGAATGATCT AAGATGTGAG TTGATGGTAG ATCTCGTGCT 47299
.......... .......... .......... .......... .......... .......... 648
CTCAGGTCCA TCACGACATG AATGCACCAT TATCGCACTA CTTCATATAC ACTGGACACA 47359
||||| || ||||||||| || || | || ||||| |||||| ||| || || ||||
..CAGGTGCA CCACGACATG GCCGCCCCGC TCTCCCACTA CTTCATCTAC ACCGGCCACA 706
ACTCGTATCT GACGGGCAAT CAACTTAGCA GTGACTGCAG TGATATTCCC ATCATTAAGG 47419
|||| || || || ||||| || || |||| | |||||||| || | ||| ||||| | ||
ACTCCTACCT CACCGGCAAC CAGCTCAGCA GCGACTGCAG CGACCTCCCC ATCATCAGGG 766
CACTGCAAAT AGGCGTCCGT GTAATTGAAC TGGACATGTG GCCAAATTCT TCTAAAGATG 47479
| || || | |||||||| || || || | | |||||||| ||| || || || || ||||
CTCTCCAGAG GGGCGTCCGC GTCATCGAGC TCGACATGTG GCCCAACTCC TCCAAGGATG 826
ATGTTGATAT TCTCCATGGA AGGTATGCAT GAGAATTGCT CACTTGAAGA CATTTTTGTT 47539
| | || |||||||| ||
ACATCAGCAT CCTCCATGGC AG........ .......... .......... .......... 848
CTGCACTGGA GGCCATTCGA TATGCTATGA CCTTATTCCA AACTATTTGC TTCTTTGGTA 47599
.......... .......... .......... .......... .......... .......... 848
GGACACTGAC TGCCCCAGTA TCACTTATCA AATGCTTGAA ATCCATCAAA GAATATGCCT 47659
||| || || |||| || || || ||| ||||| | |||||||| || | ||||
.GACGCTCAC CACCCCGGTC TCCCTCCTCA AATGCCTCCT CTCCATCAAG CAACACGCCT 907
TTGTTGCGTC TCCCTACCCT GTTATTATAA CATTAGAAGA CCACCTTACA TCTGATCTTC 47719
||| || || || ||||| ||||| || | | | ||||| |||||| || | ||||| |
TTGAGGCCTC CCCTTACCCG GTTATCATCA CGCTCGAAGA CCACCTCACC CCCGATCTCC 967
AGGCGAAAGT AGCTAAGGTA ATTGCATTTT CCTCGTATGA TCAATAATTT GGTGCAGTTG 47779
||| |||| ||| |||
AGGACAAAGC AGCCAAG... .......... .......... .......... .......... 984
ATTCTGTTGT AGCTAGTTAT GAAATTTTCT TTAGATGGTT CTTGAAGTAT TTGGAGATAC 47839
|||||| |||||||| | | || || |
.......... .......... .......... ....ATGGTT CTTGAAGTTT TCGGCGACAT 1010
CCTATATTAT CCCGAGTCAA AACATCTTCA AGAATTTCCT TCACCCGAAG CACTGAGGGG 47899
||| || || || || | | || || | ||| || ||| || || ||| || | |||
CCTCTACTAC CCTGACAAAG ATCACCTCAA AGAGTTCCCT TCGCCTCAAG ACCTCAAGGG 1070
ACGTGTCATC CTCTCAACAA AACCCCCAAA GGAGTACCTT GAATCAAAAG GTGGTACTAT 47959
|||||| || ||||| || | | ||||| | |||||||||| || | || | ||||| ||
CCGTGTCCTC CTCTCCACCA AGCCCCCCAG GGAGTACCTT CAAGCCAAGG ATGGTA--AT 1128
GAAAGACAGA GACATTGAGC CTCAGTTTAG CAAAGGACAA AATGAAGAAG CTGTCTGGGG 48019
| | || || ||| | | || | | ||| || | | | |||||
G-CTGCCACC ATCAAAGAG- GAC-GCCAAG -GCCGCCGCC ACTGACGATG CCGCATGGGG 1184
AACAGAAGTC CCAGATATTC AGGATGAGAT GCAAACCGCC GACAAGGTTC TACTGGTTTT 48079
|| ||||||| |||||||||| | | | || || | ||| || | ||
AAAAGAAGTC CCAGATATTC ACTCTCAAAT CCACTCTGCC -ACTAAACAT GACCA..... 1238
AACATTTGTT GTTTCTTGTT TCTTAGCATA TGGTGTATGT CCATCACTGT TGTATTGGCT 48139
.......... .......... .......... .......... .......... .......... 1238
TTATTCCCTA GCAGCATGAG AATGATATAC TATACACCCA AAGA-GATGT GGAAGAAGAT 48198
|| | || |||| || | || |||| || | || || ||
.......... .AAG-A-GAA GATGA-CGAC GACACCGATG AAGACGAAGA TGACGAGGAG 1284
GATGAGAAGA AAATGTGCCA GCATCACCCA CTAGAGTATA AACACCTTAT TACTATTAAG 48258
|| ||| ||| ||||| || ||||| | | | |||| | |||||||||| ||| || ||
GAGGAGCAGA AAATGCAACA GCATCTAGCT CCACAGTACA AACACCTTAT TACCATCAAA 1344
GCAGGAAAGC CAAAGGGTGC TGTAGTTGAT GCCTTAAAGG GTGATCCAGA TAAAGTTAGA 48318
|||||||||| |||| ||| | | || ||||| |||||| || |||| ||||| || ||||||
GCAGGAAAGC CAAAAGGTAC TCTACTTGAT GCCTTACAGA GTGACCCAGA AAAGGTTAGA 1404
CGCCTCAGTT TGAGTGAGCA GGAACTTGCA AAAGTGGCAG CGCATCATGG TCGTAACATC 48378
| ||||||| |||| ||||| ||||||| ||| |||||| |||||||| | | ||
AGGCTCAGTT TGAGCGAGCA ACAACTTGCC AAATTGGCAG ATCATCATGG TACCGAAATT 1464
GTGAGGTTCG TTTAGCAAAT ATACTGAATT TCGTAGCAAA GTATTTTCTA TCATTGCACC 48438
|| ||
GTAAG..... .......... .......... .......... .......... .......... 1469
AGAGCTCTCT ATGTCCATTG ACCTTAACTT CATTCTGTTT ATTCAAAGCA GCTTTACACA 48498
|| |||||
.......... .......... .......... .......... .......... .GTTCACACA 1478
TAAAAATCTT CTGAGAATAT ACCCAAAGGG CACTCGCTTC AATTCTTCGA ACTATAATCC 48558
| ||| || ||||| |||| |||||||||| |||||| || | || || | ||||||||||
GAGAAACCTA CTGAGGATAT ACCCAAAGGG CACTCGGGTC ACATCATCCA ACTATAATCC 1538
GTTTCTTGGT TGGGTGCATG GTGCACAAAT GGTGGCATTT AATATGCAGG TACATTTCTA 48618
||||||||| |||||||||| |||| || || ||| || || |||||||||
ATTTCTTGGT TGGGTGCATG GTGCTCAGAT GGTAGCGTTC AATATGCAG. .......... 1587
ACATGACACT CCTCTGCTAC ATCATATTGG CCTGAATGCC TGATACATTT TTCTTCGCAG 48678
.......... .......... .......... .......... .......... .......... 1587
GGGTATGGAA GATCTCTTTG GCTAATGCAC GGATTCTACA AGGCCAACGG TGGCTGCGGT 48738
|| ||||||| || ||||||| | | ||||| ||||| || | | || || || |||||| |||
GGATATGGAA GAGCTCTTTG GTTGATGCAT GGATTTTATA AAGCTAATGG TGGCTGTGGT 1647
TATGTGAAGA AGCCAGATTT CATGATGCAA ACTTGTCCAG ATGGAAATGT TTTTGACCCG 48798
|||||||||| | |||||||| | | |||||| ||| ||||| | | || |||||||||
TATGTGAAGA AACCAGATTT CTTAATGCAA ACTGATCCAG A--G----GT TTTTGACCCA 1701
AAAGCAGATT TACCTGTGAA GAAAACACTC AAGGTAGGTT TGTGGCATAT GTTTCTTCCT 48858
||| | || || || |||||| | |||
AAAAAATCCC TATCTCCCAA GAAAACCTTG AAG....... .......... .......... 1734
TTCATTTTCA TCTCTGAAAT TCAGGAATCG AGCTACTTAC AGCTTGCCTG TTTGTCTACC 48918
.......... .......... .......... .......... .......... .......... 1734
AGGTCAAAGT ATACATGGGC GAAGGTTGGC AGAGCGACTT CAAGCAGACA TACTTCGACA 48978
|| ||||| ||||||||| || ||||||| || ||||| || |||||| |||| ||
..GTGAAAGT ATACATGGGG GATGGTTGGC GGATGGACTT CACGCAGACC CACTTTGATC 1792
CGTATTCCCC TCCAGACTTC TACGCAAAGG TACATCGAAT TTTACGCTGA TGCCAAACGC 49038
||||| || ||||||||| || ||| |
AATATTCTCC TCCAGACTTT TATGCACGG. .......... .......... .......... 1821
CAACAAATTT GCAAATGCAA AACGGAGCTT TGAAAAAACA TGTATATATG TATAACTTTT 49098
.......... .......... .......... .......... .......... .......... 1821
ACATATGGAG TGAGATGAAG ACAAACTTTA TATCAAAATT GTAGAGCTCC ATGAGTTCTA 49158
.......... .......... .......... .......... .......... .......... 1821
CGACGTTCTT ATTGACTAGT CCATCGTTCC ATCATCATAA CAGGTGGGCA TTGCCGGGGT 49218
||||| | | || || ||
.......... .......... .......... .......... ...GTGGGGA TAGCGGGAGT 1838
TCCGTCGGAC TCGGTGATGC AGAAGACGAA AGCCGTGGAG GACAGCTGGG TTCCCGTGTG 49278
|| ||||| ||||||||| ||| |||| || | ||| || | ||||| | || |||||
ACCAGCGGAC TCGGTGATGA AGAGAACGAG GGCGATAGAG GATAACTGGG TGCCGGTGTG 1898
GGAGGAGGAG TTCGTGTTCC CGCTGACCGT CCCGGAGATC GCGCTGCTCC GCGTGGAGGT 49338
||||||||| ||| ||| |||||||| ||||||||| ||| |||| | | ||||||||
GGAGGAGGAT TTCACCTTCA AACTGACCGT GCCGGAGATC GCGTTGCTGC GGGTGGAGGT 1958
GCACGAGTAC GAC--GTGAG CG-AGGACGA CTTCGGCGGG CAGACGGCGC TCCCGGTGTC 49395
|||||||||| ||| || | | ||||||| ||||||||| ||||||| || | ||||||||
GCACGAGTAC GACATGTCGG AGAAGGACGA CTTCGGCGGC CAGACGGTGC TGCCGGTGTC 2018
GGAGCTGCGG CCGGGGATCC GCACCGTGCC GCTCTTCGAC CACAAGGGGC TCAAGTTCAA 49455
||| || |||||||||| | | ||| | ||| |||| | ||| ||| ||||||| ||
GGATCTCATC CCGGGGATCC GAGCGGTGGC ACTCCACGAC CGCAAAGGGA TCAAGTTGAA 2078
GAGCGTCAAG CTCCTCATGC GGTTCGAGTT CGTCT-AGCA AATTCA 49500
| ||||||| || ||||||| | |||||||| | | | | || ||
CAACGTCAAG CTTCTCATGC GCTTCGAGTT TGAATGACCC AACACA 2124
hqPGS_AC137075+_32980476+ (47302 47501,47601 47736,47814 48074,48151 48383,48490 48607,48679 48831,48921 49007,49202 49500)
<A NAME="PGS7@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 7 +strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=12698877" TARGET="NUCLEOTIDE SEQUENCE SEARCH">12698877+</A>)
1 CGGCACGAGG TCCAATTCCA ATTATAATTC CAAGAGGCTC ATCATCAACT CAACTGAAGA
61 AGAAGAAGAA GAAGAAGAAG AAGAAGAGGA GGAGGGTTCG GCACGAGGTG GGAAGGGAGG
121 GGGAGGAGGA AGGCGATCGA TCGGCGATGG GCACGTACAA GTGCTGCCTC ATCTTCAAGC
181 GCCGCTTCCG CTGGAACGAC GCGCCGCCGC CCGACGATGT CCGCGCCCTC TTCGCCAACC
241 ACTCCGCCGG CGGTGGCCCC CACATGGCCG CCGACGGCCT CCGCGCCTAC CTCCAGGCCA
301 CCGGCACGAG GACGGCGACG TGGACATGGA GCGGCTGGTG GAGCAGATCC GGCAGCTGCA
361 GGGGCGCGGC GGGCGCATCC CGCGGGTGGG GCGGGCACTC CCACTCCTGC ACGGTGGACG
421 ACTTCCACCG ATTCCTCTTC TCCCACGAGC TGAACCCACC CATCCGGCAC GGGCAGGGGC
481 AGGTGCACCA CGACATGGCC GCCCCGCTCT CCCACTACTT CATCTACACC GGCCACAACT
541 CCTACCTCAC CGGCAACCAG CTCAGCAGCG ACTGCAGCGA CCTCCCCATC ATCAGGGCTC
601 TCCAGAGGGG CGTCCGCGTC ATCGAGCTCG ACATGTGGCC CAACTCCTCC AAGGATGACA
661 TCAGCATCCT CCATGGCAGG ACGCCCACCA CCCCGGTCTC CCTCCTCAAA TGCCTCCTCT
721 CCATCAAGGA ACACGCCTTT GAGGCCTCCC CTTACCCGGT TATCATCACG CTCGAAGACC
781 ACCTCACCCC CGATCTCCAG GACAAAGCAG CCAAGATGGT TCTTGAAGTT TTCGGCGACA
841 TCCTCTACTA CCCTGACAAA GATCACCTCA AAGAGTTCCC TTCGCCTCAA GACCTCAAGG
901 GCCGTGTCCT CCTCTCCACC AAGCCCCCCA AGGAGTACCT TCAAGCCAAG GATGGTAATG
961 CTGCCACCAT CAAAGAGGAC GCCAAGGCCG CCGCCACTGA CGATGCCGCA TGGGGAAAAG
1021 AAGTCCCAGA TATTCACTCT CAAATCCACT CTGCCACTAA ACATGACCAA AGAGAAGATG
1081 ACGACGACAC CGATGAAGAC GAAGATGACG AGGAGGAGGA GCAGAAAATG CAACAGCATC
1141 TAGCTCCACA GTACAAACAC CTTATTACCA TCAAAGCAGG AAAGCCAAAA GGTACTCTAC
1201 TTGATGCCTT ACAGAGTGAC CCAGAAAAGG TTAGAAGGCT CAGTTTGAGC GAGCAACAAC
1261 TTGCCAAATT GGCAGATCAT CATGGTACCG AAATTGTCAG GTTCACACAG AGAAACCTAC
1321 TGAGGATATA CCCAAAGGGC ACTCGGGTCA CATCATCCAA CTATAATCCA TTTCTTGGTT
1381 GGGTGCATGG TGCTCAGATG GTAGCGTTCA ATATGCAGGG ATATGGAAGA GCTCTTTGGT
1441 TGATGCATGG ATTTTATAAA GCTAATGGTG GCTGTGGTTA TGTGAAGAAA CCAGATTTCT
1501 TAATGCAAAC TGATCCAGAG GTTTTTGACC CAAAAAAATC CCTATCTTCC AAGAAAACCT
1561 TGAAGGTGAA AGTATACATG GGGGATGGTT GGCGGATGGA CTTCACGCAG ACCCACTTTG
1621 ATCAATATTC TCCTCCAGAC TTTTATGCAC GGGTGGGGAT AGCGGGAGTA CCAGCGGACT
1681 CGGTGATGAA GAGAACGAGG GCGATAGAGG ATAACTGGGT GCCGGTGTGG GAGGAGGATT
1741 TCACCTTCAA ACTGACCGTG CCGGAGATCG CGTTGCTGCG GGTGGAGGTG CACGAGTACG
1801 ACATGTCGGA GAAGGACGAC TTCGGCGGCC AGACGGTGCT GCCGGTGTCG GAGCTCATCC
1861 CGGGGATCCG AGCGGTGGCA CTCCACGACC GCAAAGGGAT CAAGTTGAAC AACGTCAAGC
1921 TTCTCATGCG CTTCGAGTTT GAATGACCCA ACACACCGAC ACTTTCTTTC TCGCCGCATT
1981 GCATTGCACT GTGCCTGTGC CTGTGCAGCA TCCATCATTT GGTTTGGTTT TTCATGTTCC
2041 TGTGCATACG CATTTGTGTC TGTACATAGG CTCGGTCCTG TATATTGTTT GTGAGTAACA
2101 TGTAACAATA AGGCTTCACG CCATGTTCAT TCCTCCTGTT TGAACCCCTC CAAATATATT
2161 AATACCGAAG GTACAAAATA TCTCAACGGC AAAAAAAAAA AAAAAAA
Predicted gene structure (within gDNA segment 45068 to 51000):
Exon 1 45614 45626 ( 13 n); cDNA 131 143 ( 13 n); score: 0.846
Intron 1 45627 46117 ( 491 n); Pd: 0.892 (s: 0), Pa: 0.000 (s: 0.80)
Exon 2 46118 46439 ( 322 n); cDNA 144 479 ( 336 n); score: 0.606
Intron 2 46440 47301 ( 862 n); Pd: 0.852 (s: 0.68), Pa: 0.000 (s: 0.78)
Exon 3 47302 47501 ( 200 n); cDNA 480 679 ( 200 n); score: 0.750
Intron 3 47502 47600 ( 99 n); Pd: 0.981 (s: 0.74), Pa: 0.943 (s: 0.68)
Exon 4 47601 47736 ( 136 n); cDNA 680 815 ( 136 n); score: 0.721
Intron 4 47737 47813 ( 77 n); Pd: 0.967 (s: 0.74), Pa: 0.462 (s: 0.72)
Exon 5 47814 48074 ( 261 n); cDNA 816 1069 ( 254 n); score: 0.651
Intron 5 48075 48150 ( 76 n); Pd: 0.950 (s: 0.62), Pa: 0.864 (s: 0.53)
Exon 6 48151 48383 ( 233 n); cDNA 1070 1300 ( 231 n); score: 0.730
Intron 6 48384 48489 ( 106 n); Pd: 0.981 (s: 0.74), Pa: 0.477 (s: 0.82)
Exon 7 48490 48607 ( 118 n); cDNA 1301 1418 ( 118 n); score: 0.839
Intron 7 48608 48678 ( 71 n); Pd: 0.976 (s: 0.88), Pa: 0.976 (s: 0.80)
Exon 8 48679 48831 ( 153 n); cDNA 1419 1565 ( 147 n); score: 0.765
Intron 8 48832 48920 ( 89 n); Pd: 0.988 (s: 0.64), Pa: 0.739 (s: 0.82)
Exon 9 48921 49007 ( 87 n); cDNA 1566 1652 ( 87 n); score: 0.782
Intron 9 49008 49201 ( 194 n); Pd: 0.601 (s: 0.74), Pa: 0.941 (s: 0.76)
Exon 10 49202 49500 ( 299 n); cDNA 1653 1955 ( 303 n); score: 0.766
PPA cDNA 2191 2207
MATCH AC137075+ 12698877+ 0.716 1822 0.826 C
PGS_AC137075+_12698877+ (45614 45626,46118 46439,47302 47501,47601 47736,47814 48074,48151 48383,48490 48607,48679 48831,48921 49007,49202 49500)
Alignment (genomic DNA sequence = upper lines):
AGGCGATCCA TCCGTAGCTT TCTCTTAAAA TGTTCCCTTA TTTGCTTCAA AATTCAATAT 45673
|||||||| | ||
AGGCGATCGA TCG....... .......... .......... .......... .......... 143
CATCTTTAAA AATTGACAAT ATTAATAAAT TAATTATGTA AAATTTTAAT TCCAAATTCA 45733
.......... .......... .......... .......... .......... .......... 143
ATTAATAGAT AAAGAATTAA TTAAACTCAA AGTGGTAGAA GAACTATTTA GTCAGATTAT 45793
.......... .......... .......... .......... .......... .......... 143
TTTAACTTGT ATGGGTTGAA TATGAGCTTG AATATTGGTG GAGTGATACA TCGCTAAATT 45853
.......... .......... .......... .......... .......... .......... 143
ACCTACTTTC ACAATTTTGA AGTATGTTAT AAACTACTCT CACAATTTTT ATAGTACATG 45913
.......... .......... .......... .......... .......... .......... 143
CATTTTTCCT TAAAGCAAAA TGACTTCCCT AACCATCCTC GGCACCTCCA AATCGCCCTA 45973
.......... .......... .......... .......... .......... .......... 143
AAAACTTGGA ATGCATGCGC GGTGCGCGTT CCGCCTCACA CGCCTCCGTC GGGAGCCAGA 46033
.......... .......... .......... .......... .......... .......... 143
CGCCGAGCCG GGACGAGCGG GCAAACAGGG AAAAAGCGTG TGCGCGGCGA AGGTCCATGA 46093
.......... .......... .......... .......... .......... .......... 143
AGGCGGCGCC GGCGCCGGCG ACGCGCAAAT GGGGACGTAC AAGTGCTGCA TCTTCTTCAC 46153
|| || ||| |||||| ||||||||| || ||||||
.......... .......... ....GC-GAT GGGCACGTAC AAGTGCTGCC TCATCTTCAA 178
CCGCAGGTTC GCGCT-GAGC GACGCGTCCA CGCCGGGCGA CGTGCGCATG CTGTTCACCC 46212
||| | ||| |||| || | |||||| | |||| | ||| || ||| || ||| ||
GCGCCGCTTC -CGCTGGAAC GACGCGCCGC CGCCCGACGA TGTCCGCGCC CTCTTCGCCA 237
GCCACGCCGG CGGC-GC-GC CCT-ACATGG GCATCGACGA GCTCCGGCGC TACCTCGCCG 46269
|||| ||| |||| | || || |||||| | ||||| ||||| | |||||| |
ACCACTCCGC CGGCGGTGGC CCCCACATGG CCGCCGACGG CCTCCGCGCC TACCTCCAGG 297
CCAGCGGGGA GGCCCACGTC GACGCCGACA CGGCGGAGC- GGATCATC-G A-CCGGGTCC 46326
||| | || | | ||| | |||| |||| || | || || | | | | |||| |
CCA-CCGGCA CGAGGACGGC GACGTGGACA TGGAGCGGCT GGTGGAGCAG ATCCGGCAGC 356
TGCAGGAGCG CAGC---CGC ACCCCGC-GC T---TCGGG- AAGCCGTC-G CT-CACCATC 46376
|||||| ||| | || ||| | ||||| | | |||| | || | || ||| |
TGCAGGGGCG CGGCGGGCGC ATCCCGCGGG TGGGGCGGGC ACTCCCACTC CTGCACGGTG 416
GACGATTTCC AGTACTTCCT CTTCTCCGAG GACCTCAACC CGCCCATCTG CCATTCCAAG 46436
||||| |||| | ||||| ||||||| | || || |||| | |||||| | || ||
GACGACTTCC ACCGATTCCT CTTCTCCCAC GAGCTGAACC CACCCATCCG GCACGGGCAG 476
GAAGTAAGCA AACTACCCGC TCGATCCCCA ATTTCCCAAA TGCTGTTAGA TTCATCGTCA 46496
|
GGG....... .......... .......... .......... .......... .......... 479
TTCCGTGATA ATCCTGCCGT TGCACAATGC GGTGAAATGG CGTAATTTGC TAGGATTCAG 46556
.......... .......... .......... .......... .......... .......... 479
AAGGGGATTC TTGGGGTTTG TTTAGTTCAC ATTAAAATTA AAAGTTTGGT TAAAATTGGA 46616
.......... .......... .......... .......... .......... .......... 479
ATGATGTGAC GAAAAGTTAG AAGTTTGTGT GTGCAGGAAA GTTTTGATGC GATGGAAAAG 46676
.......... .......... .......... .......... .......... .......... 479
TTGGAAGTTT GAAGAAAAAA ATTAAAACTA AACATGGCTT TGGTCGGAAC TGCTCTGTAG 46736
.......... .......... .......... .......... .......... .......... 479
TGTGGACGTC ATTCAAATCT TTATGAAGTA TTTTTTTAAA GATGGATCAC ACATGTGATT 46796
.......... .......... .......... .......... .......... .......... 479
AACATAGTTA TATAAAATTT TGTTAAAATT TGAAAATGTA GAATACGATG ATATAAATCA 46856
.......... .......... .......... .......... .......... .......... 479
CTATATAAAC ATGCAAGTTT AAATTTGATC CACGCAAAGA GAAAAAATAT AACCGATTAT 46916
.......... .......... .......... .......... .......... .......... 479
GTTTGAGTTG TGGCATTACT ATTTTCTATC TGGTTCTATT AATTTTTTTT CTCCAATTGT 46976
.......... .......... .......... .......... .......... .......... 479
AGATCGAATC AAGCCTTTGT ATGTTTGTAC ATAGACTTAT GCTATCGTAA TCTACTCCCA 47036
.......... .......... .......... .......... .......... .......... 479
TTTTTTTTGG ACGGAGGGAG TATGTTATCA ATTTTAGTTT AATTTTTTTT ACAACTATTT 47096
.......... .......... .......... .......... .......... .......... 479
GGGTCACATA CAAATAACTG GCACATATGC ACCTAGGTGT AAAGAAGTCA ACATGCAGGT 47156
.......... .......... .......... .......... .......... .......... 479
AATGAATTGA ATTTCCATAC AACATTCTGC TCTCCTAAGA AATTACGCTT ACAAGTTCAC 47216
.......... .......... .......... .......... .......... .......... 479
TTGGATATTG CTAAACTCCA TTTTGATATT ACTTAGTGTG TACTGAATGA TCTAAGATGT 47276
.......... .......... .......... .......... .......... .......... 479
GAGTTGATGG TAGATCTCGT GCTCTCAGGT CCATCACGAC ATGAATGCAC CATTATCGCA 47336
||||| || |||||| ||| || | | | || ||
.......... .......... .....CAGGT GCACCACGAC ATGGCCGCCC CGCTCTCCCA 514
CTACTTCATA TACACTGGAC ACAACTCGTA TCTGACGGGC AATCAACTTA GCAGTGACTG 47396
||||||||| ||||| || | ||||||| || || || ||| || || || | |||| |||||
CTACTTCATC TACACCGGCC ACAACTCCTA CCTCACCGGC AACCAGCTCA GCAGCGACTG 574
CAGTGATATT CCCATCATTA AGGCACTGCA AATAGGCGTC CGTGTAATTG AACTGGACAT 47456
||| || | |||||||| | ||| || || | |||||| || || || | | || |||||
CAGCGACCTC CCCATCATCA GGGCTCTCCA GAGGGGCGTC CGCGTCATCG AGCTCGACAT 634
GTGGCCAAAT TCTTCTAAAG ATGATGTTGA TATTCTCCAT GGAAGGTATG CATGAGAATT 47516
|||||| || || || || | |||| | || |||||| || ||
GTGGCCCAAC TCCTCCAAGG ATGACATCAG CATCCTCCAT GGCAG..... .......... 679
GCTCACTTGA AGACATTTTT GTTCTGCACT GGAGGCCATT CGATATGCTA TGACCTTATT 47576
.......... .......... .......... .......... .......... .......... 679
CCAAACTATT TGCTTCTTTG GTAGGACACT GACTGCCCCA GTATCACTTA TCAAATGCTT 47636
||| | || |||| || || || |||||||| |
.......... .......... ....GACGCC CACCACCCCG GTCTCCCTCC TCAAATGCCT 715
GAAATCCATC AAAGAATATG CCTTTGTTGC GTCTCCCTAC CCTGTTATTA TAACATTAGA 47696
|||||| || ||| | | |||||| || || || ||| || ||||| | | || | ||
CCTCTCCATC AAGGAACACG CCTTTGAGGC CTCCCCTTAC CCGGTTATCA TCACGCTCGA 775
AGACCACCTT ACATCTGATC TTCAGGCGAA AGTAGCTAAG GTAATTGCAT TTTCCTCGTA 47756
||||||||| || | |||| | |||| || || ||| |||
AGACCACCTC ACCCCCGATC TCCAGGACAA AGCAGCCAAG .......... .......... 815
TGATCAATAA TTTGGTGCAG TTGATTCTGT TGTAGCTAGT TATGAAATTT TCTTTAGATG 47816
|||
.......... .......... .......... .......... .......... .......ATG 818
GTTCTTGAAG TATTTGGAGA TACCCTATAT TATCCCGAGT CAAAACATCT TCAAGAATTT 47876
|||||||||| | || || || | ||| || || || || | | || || |||| ||
GTTCTTGAAG TTTTCGGCGA CATCCTCTAC TACCCTGACA AAGATCACCT CAAAGAGTTC 878
CCTTCACCCG AAGCACTGAG GGGACGTGTC ATCCTCTCAA CAAAACCCCC AAAGGAGTAC 47936
||||| || ||| || | ||| |||||| ||||||| | | || ||||| |||||||||
CCTTCGCCTC AAGACCTCAA GGGCCGTGTC CTCCTCTCCA CCAAGCCCCC CAAGGAGTAC 938
CTTGAATCAA AAGGTGGTAC TATGAAAGAC AGAGACATTG AGCCTCAGTT TAGCAAAGGA 47996
||| || | | | | ||||| ||| | | | || | || | | || |
CTTCAAGCCA AGGATGGTA- -ATG-CTGCC ACCATCAAAG AG-GAC-GCC AAG-GCCGCC 992
CAAAATGAAG AAGCTGTCTG GGGAACAGAA GTCCCAGATA TTCAGGATGA GATGCAAACC 48056
| ||| | | || | || ||||| |||| |||||||||| |||| | | || || |
GCCACTGACG ATGCCGCATG GGGAAAAGAA GTCCCAGATA TTCACTCTCA AATCCACTCT 1052
GCCGACAAGG TTCTACTGGT TTTAACATTT GTTGTTTCTT GTTTCTTAGC ATATGGTGTA 48116
||| || | ||
GCC-ACTAAA CATGACCA.. .......... .......... .......... .......... 1069
TGTCCATCAC TGTTGTATTG GCTTTATTCC CTAGCAGCAT GAGAATGATA TACTATACAC 48176
|| | || |||| || | ||
.......... .......... .......... ....AAG-A- GAAGATGA-C GACGACACCG 1092
CCAAAGA-GA TGTGGAAGAA GATGATGAGA AGAAAATGTG CCAGCATCAC CCACTAGAGT 48235
|||| || | || || || || ||| |||||||| ||||||| | | | |||
ATGAAGACGA AGATGACGAG GAGGAGGAGC AGAAAATGCA ACAGCATCTA GCTCCACAGT 1152
ATAAACACCT TATTACTATT AAGGCAGGAA AGCCAAAGGG TGCTGTAGTT GATGCCTTAA 48295
| |||||||| |||||| || || ||||||| ||||||| || | || || || |||||||||
ACAAACACCT TATTACCATC AAAGCAGGAA AGCCAAAAGG TACTCTACTT GATGCCTTAC 1212
AGGGTGATCC AGATAAAGTT AGACGCCTCA GTTTGAGTGA GCAGGAACTT GCAAAAGTGG 48355
|| |||| || ||| || ||| ||| | |||| ||||||| || ||| ||||| || ||| |||
AGAGTGACCC AGAAAAGGTT AGAAGGCTCA GTTTGAGCGA GCAACAACTT GCCAAATTGG 1272
CAGCGCATCA TGGTCGTAAC ATCGTGAGGT TCGTTTAGCA AATATACTGA ATTTCGTAGC 48415
||| ||||| |||| | || || ||
CAGATCATCA TGGTACCGAA ATTGTCAG.. .......... .......... .......... 1300
AAAGTATTTT CTATCATTGC ACCAGAGCTC TCTATGTCCA TTGACCTTAA CTTCATTCTG 48475
.......... .......... .......... .......... .......... .......... 1300
TTTATTCAAA GCAGCTTTAC ACATAAAAAT CTTCTGAGAA TATACCCAAA GGGCACTCGC 48535
|| || ||| | ||| || ||||| | |||||||||| |||||||||
.......... ....GTTCAC ACAGAGAAAC CTACTGAGGA TATACCCAAA GGGCACTCGG 1346
TTCAATTCTT CGAACTATAA TCCGTTTCTT GGTTGGGTGC ATGGTGCACA AATGGTGGCA 48595
||| || | | |||||||| ||| |||||| |||||||||| ||||||| || ||||| ||
GTCACATCAT CCAACTATAA TCCATTTCTT GGTTGGGTGC ATGGTGCTCA GATGGTAGCG 1406
TTTAATATGC AGGTACATTT CTAACATGAC ACTCCTCTGC TACATCATAT TGGCCTGAAT 48655
|| ||||||| ||
TTCAATATGC AG........ .......... .......... .......... .......... 1418
GCCTGATACA TTTTTCTTCG CAGGGGTATG GAAGATCTCT TTGGCTAATG CACGGATTCT 48715
|| |||| ||||| |||| |||| | ||| || ||||| |
.......... .......... ...GGATATG GAAGAGCTCT TTGGTTGATG CATGGATTTT 1455
ACAAGGCCAA CGGTGGCTGC GGTTATGTGA AGAAGCCAGA TTTCATGATG CAAACTTGTC 48775
| || || || |||||||| |||||||||| |||| ||||| |||| | ||| |||||| ||
ATAAAGCTAA TGGTGGCTGT GGTTATGTGA AGAAACCAGA TTTCTTAATG CAAACTGATC 1515
CAGATGGAAA TGTTTTTGAC CCGAAAGCAG ATTTACCTGT GAAGAAAACA CTCAAGGTAG 48835
|||| | ||||||||| || ||| | || || |||||||| | |||
CAGA--G--- -GTTTTTGAC CCAAAAAAAT CCCTATCTTC CAAGAAAACC TTGAAG.... 1565
GTTTGTGGCA TATGTTTCTT CCTTTCATTT TCATCTCTGA AATTCAGGAA TCGAGCTACT 48895
.......... .......... .......... .......... .......... .......... 1565
TACAGCTTGC CTGTTTGTCT ACCAGGTCAA AGTATACATG GGCGAAGGTT GGCAGAGCGA 48955
|| || |||||||||| || || |||| ||| || ||
.......... .......... .....GTGAA AGTATACATG GGGGATGGTT GGCGGATGGA 1600
CTTCAAGCAG ACATACTTCG ACACGTATTC CCCTCCAGAC TTCTACGCAA AGGTACATCG 49015
||||| |||| || |||| | | ||||| ||||||||| || || ||| |
CTTCACGCAG ACCCACTTTG ATCAATATTC TCCTCCAGAC TTTTATGCAC GG........ 1652
AATTTTACGC TGATGCCAAA CGCCAACAAA TTTGCAAATG CAAAACGGAG CTTTGAAAAA 49075
.......... .......... .......... .......... .......... .......... 1652
ACATGTATAT ATGTATAACT TTTACATATG GAGTGAGATG AAGACAAACT TTATATCAAA 49135
.......... .......... .......... .......... .......... .......... 1652
ATTGTAGAGC TCCATGAGTT CTACGACGTT CTTATTGACT AGTCCATCGT TCCATCATCA 49195
.......... .......... .......... .......... .......... .......... 1652
TAACAGGTGG GCATTGCCGG GGTTCCGTCG GACTCGGTGA TGCAGAAGAC GAAAGCCGTG 49255
|||| | || || || || || || |||||||||| || ||| || || || |
......GTGG GGATAGCGGG AGTACCAGCG GACTCGGTGA TGAAGAGAAC GAGGGCGATA 1706
GAGGACAGCT GGGTTCCCGT GTGGGAGGAG GAGTTCGTGT TCCCGCTGAC CGTCCCGGAG 49315
||||| | || |||| || || |||||||||| || ||| | || ||||| ||| ||||||
GAGGATAACT GGGTGCCGGT GTGGGAGGAG GATTTCACCT TCAAACTGAC CGTGCCGGAG 1766
ATCGCGCTGC TCCGCGTGGA GGTGCACGAG TACGAC--GT GAGCG-AGGA CGACTTCGGC 49372
|||||| ||| | || ||||| |||||||||| |||||| || | | |||| ||||||||||
ATCGCGTTGC TGCGGGTGGA GGTGCACGAG TACGACATGT CGGAGAAGGA CGACTTCGGC 1826
GGGCAGACGG CGCTCCCGGT GTCGGAGCTG CGGCCGGGGA TCCGCACCGT GCCGCTCTTC 49432
|| ||||||| ||| ||||| ||||||||| ||||||| |||| | || | | ||| |
GGCCAGACGG TGCTGCCGGT GTCGGAGCTC ATCCCGGGGA TCCGAGCGGT GGCACTCCAC 1886
GACCACAAGG GGCTCAAGTT CAAGAGCGTC AAGCTCCTCA TGCGGTTCGA GTTCGTCT-A 49491
|||| ||| | || ||||||| || | |||| ||||| |||| |||| ||||| ||| | | |
GACCGCAAAG GGATCAAGTT GAACAACGTC AAGCTTCTCA TGCGCTTCGA GTTTGAATGA 1946
GCAAATTCA 49500
| || ||
CCCAACACA 1955
hqPGS_AC137075+_12698877+ (47302 47501,47601 47736,47814 48074,48151 48383,48490 48607,48679 48831,48921 49007,49202 49500)
<A NAME="PGS6@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 6 +strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=29680389" TARGET="NUCLEOTIDE SEQUENCE SEARCH">29680389+</A>)
1 CAGATCCGGC AGCCTTTTGG GCGCGGCGGG CGCATCCCGC GGGTGGGGCG GGCACTCCCA
61 CTCCTGACGG TGGACGACTT CCACCGATTC CTCTTCTCCC ACGAGCTGAA CCCACCCATC
121 CGGCACGGGC AGGGGCAGGT GCACCACGAC ATGGCCGCCC CGCTCTCCCA CTACTTCATC
181 TACACCGGCC ACAACTCCTA CCTCACCGGC AACCAGCTCA GCAGCGACTG CAGCGACCTC
241 CCCATCATCA GGGCTCTCCA GAGGGGCGTC CGCGTCATCG AGCTCGACAT GTGGCCCAAC
301 TCCTCCAAGG ATGACATCAG CATCCTCCAT GGCAGGACGC TCACCACCCC GGTCTCCCTC
361 CTCAAATGCC TCCTCTCCAT CAAGCAACAC GCCTTTGAGG CCTCCCCTTA CCCGGTTATC
421 ATCACGCTCG AAGACCACCT CACCCCCGAT CTCCAGGACA AAGCAGCCAA GATGGTTCTT
481 GAAGTTTTCG GCGACATCCT CTACTACCCT GACAAAGATC ACCTCAAAGA GTTCCCTTCG
541 CCTCAAGACC TCAAGGGCCG TGTCCTCCTC TCCACCAAGC CCCCCAGGGA GTACCTTCAA
601 GCCAAGGATG GTAATGCTGC CACCATCAAA GAGGACGCCA AGGCCGCCGC CACTGACGAT
661 GCCGCATGGG GAAAAGAAGT CCCAGATATT CACTCTCAAA TCCACTCTGC CACTAAACAT
721 GACCACAGAG AAGATGACGA CGACACCGAT
Predicted gene structure (within gDNA segment 45167 to 48635):
Exon 1 46380 46439 ( 60 n); cDNA 76 135 ( 60 n); score: 0.683
Intron 1 46440 47301 ( 862 n); Pd: 0.852 (s: 0.68), Pa: 0.000 (s: 0.78)
Exon 2 47302 47501 ( 200 n); cDNA 136 335 ( 200 n); score: 0.750
Intron 2 47502 47600 ( 99 n); Pd: 0.981 (s: 0.74), Pa: 0.943 (s: 0.68)
Exon 3 47601 47736 ( 136 n); cDNA 336 471 ( 136 n); score: 0.721
Intron 3 47737 47813 ( 77 n); Pd: 0.967 (s: 0.74), Pa: 0.462 (s: 0.72)
Exon 4 47814 47955 ( 142 n); cDNA 472 613 ( 142 n); score: 0.732
MATCH AC137075+ 29680389+ 0.730 538 0.717 C
PGS_AC137075+_29680389+ (46380 46439,47302 47501,47601 47736,47814 47955)
Alignment (genomic DNA sequence = upper lines):
GATTTCCAGT ACTTCCTCTT CTCCGAGGAC CTCAACCCGC CCATCTGCCA TTCCAAGGAA 46439
|| ||||| |||||||| |||| | || || ||||| | ||||| | || |||
GACTTCCACC GATTCCTCTT CTCCCACGAG CTGAACCCAC CCATCCGGCA CGGGCAGGGG 135
GTAAGCAAAC TACCCGCTCG ATCCCCAATT TCCCAAATGC TGTTAGATTC ATCGTCATTC 46499
.......... .......... .......... .......... .......... .......... 135
CGTGATAATC CTGCCGTTGC ACAATGCGGT GAAATGGCGT AATTTGCTAG GATTCAGAAG 46559
.......... .......... .......... .......... .......... .......... 135
GGGATTCTTG GGGTTTGTTT AGTTCACATT AAAATTAAAA GTTTGGTTAA AATTGGAATG 46619
.......... .......... .......... .......... .......... .......... 135
ATGTGACGAA AAGTTAGAAG TTTGTGTGTG CAGGAAAGTT TTGATGCGAT GGAAAAGTTG 46679
.......... .......... .......... .......... .......... .......... 135
GAAGTTTGAA GAAAAAAATT AAAACTAAAC ATGGCTTTGG TCGGAACTGC TCTGTAGTGT 46739
.......... .......... .......... .......... .......... .......... 135
GGACGTCATT CAAATCTTTA TGAAGTATTT TTTTAAAGAT GGATCACACA TGTGATTAAC 46799
.......... .......... .......... .......... .......... .......... 135
ATAGTTATAT AAAATTTTGT TAAAATTTGA AAATGTAGAA TACGATGATA TAAATCACTA 46859
.......... .......... .......... .......... .......... .......... 135
TATAAACATG CAAGTTTAAA TTTGATCCAC GCAAAGAGAA AAAATATAAC CGATTATGTT 46919
.......... .......... .......... .......... .......... .......... 135
TGAGTTGTGG CATTACTATT TTCTATCTGG TTCTATTAAT TTTTTTTCTC CAATTGTAGA 46979
.......... .......... .......... .......... .......... .......... 135
TCGAATCAAG CCTTTGTATG TTTGTACATA GACTTATGCT ATCGTAATCT ACTCCCATTT 47039
.......... .......... .......... .......... .......... .......... 135
TTTTTGGACG GAGGGAGTAT GTTATCAATT TTAGTTTAAT TTTTTTTACA ACTATTTGGG 47099
.......... .......... .......... .......... .......... .......... 135
TCACATACAA ATAACTGGCA CATATGCACC TAGGTGTAAA GAAGTCAACA TGCAGGTAAT 47159
.......... .......... .......... .......... .......... .......... 135
GAATTGAATT TCCATACAAC ATTCTGCTCT CCTAAGAAAT TACGCTTACA AGTTCACTTG 47219
.......... .......... .......... .......... .......... .......... 135
GATATTGCTA AACTCCATTT TGATATTACT TAGTGTGTAC TGAATGATCT AAGATGTGAG 47279
.......... .......... .......... .......... .......... .......... 135
TTGATGGTAG ATCTCGTGCT CTCAGGTCCA TCACGACATG AATGCACCAT TATCGCACTA 47339
||||| || ||||||||| || || | || |||||
.......... .......... ..CAGGTGCA CCACGACATG GCCGCCCCGC TCTCCCACTA 173
CTTCATATAC ACTGGACACA ACTCGTATCT GACGGGCAAT CAACTTAGCA GTGACTGCAG 47399
|||||| ||| || || |||| |||| || || || ||||| || || |||| | ||||||||
CTTCATCTAC ACCGGCCACA ACTCCTACCT CACCGGCAAC CAGCTCAGCA GCGACTGCAG 233
TGATATTCCC ATCATTAAGG CACTGCAAAT AGGCGTCCGT GTAATTGAAC TGGACATGTG 47459
|| | ||| ||||| | || | || || | |||||||| || || || | | ||||||||
CGACCTCCCC ATCATCAGGG CTCTCCAGAG GGGCGTCCGC GTCATCGAGC TCGACATGTG 293
GCCAAATTCT TCTAAAGATG ATGTTGATAT TCTCCATGGA AGGTATGCAT GAGAATTGCT 47519
||| || || || || |||| | | || |||||||| ||
GCCCAACTCC TCCAAGGATG ACATCAGCAT CCTCCATGGC AG........ .......... 335
CACTTGAAGA CATTTTTGTT CTGCACTGGA GGCCATTCGA TATGCTATGA CCTTATTCCA 47579
.......... .......... .......... .......... .......... .......... 335
AACTATTTGC TTCTTTGGTA GGACACTGAC TGCCCCAGTA TCACTTATCA AATGCTTGAA 47639
||| || || |||| || || || ||| ||||| |
.......... .......... .GACGCTCAC CACCCCGGTC TCCCTCCTCA AATGCCTCCT 374
ATCCATCAAA GAATATGCCT TTGTTGCGTC TCCCTACCCT GTTATTATAA CATTAGAAGA 47699
|||||||| || | |||| ||| || || || ||||| ||||| || | | | |||||
CTCCATCAAG CAACACGCCT TTGAGGCCTC CCCTTACCCG GTTATCATCA CGCTCGAAGA 434
CCACCTTACA TCTGATCTTC AGGCGAAAGT AGCTAAGGTA ATTGCATTTT CCTCGTATGA 47759
|||||| || | ||||| | ||| |||| ||| |||
CCACCTCACC CCCGATCTCC AGGACAAAGC AGCCAAG... .......... .......... 471
TCAATAATTT GGTGCAGTTG ATTCTGTTGT AGCTAGTTAT GAAATTTTCT TTAGATGGTT 47819
||||||
.......... .......... .......... .......... .......... ....ATGGTT 477
CTTGAAGTAT TTGGAGATAC CCTATATTAT CCCGAGTCAA AACATCTTCA AGAATTTCCT 47879
|||||||| | | || || | ||| || || || || | | || || | ||| || |||
CTTGAAGTTT TCGGCGACAT CCTCTACTAC CCTGACAAAG ATCACCTCAA AGAGTTCCCT 537
TCACCCGAAG CACTGAGGGG ACGTGTCATC CTCTCAACAA AACCCCCAAA GGAGTACCTT 47939
|| || ||| || | ||| |||||| || ||||| || | | ||||| | ||||||||||
TCGCCTCAAG ACCTCAAGGG CCGTGTCCTC CTCTCCACCA AGCCCCCCAG GGAGTACCTT 597
GAATCAAAAG GTGGTA 47955
|| | || | |||||
CAAGCCAAGG ATGGTA 613
hqPGS_AC137075+_29680389+ (47302 47501,47601 47736,47814 47955)
<A NAME="PGS2@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 2 +strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=27578307" TARGET="NUCLEOTIDE SEQUENCE SEARCH">27578307+</A>)
1 CCGCCACTGA CGATGCCGAG GGGGGAAAAG AAGTCCCAGA TATTCACTCT CAAATCCACT
61 CTGCCACTAA ACATGACCAA AGAGAAGATG ACGACGACAC CGATGAAGAC GAAGATGACG
121 AGGAGGAGGA GCAGAAAATG CAACAGCATC TAGCTCCACA GTACAAACAC CTTATTACCA
181 TCAAAGCAGG AAAGCCAAAA GGTACTCTAC TTGATGCCTT ACAGAGTGAC CCAGAAAAGG
241 TTAGAAGGCT CAGTTTGAGC GAGCAACAAC TTGCCAAATT GGCAGATCAT CATGGTACCG
301 AAATTGTAAG GTTCACACAG AGAAACCTAC TGAGGATATA CCCAAAGGGC ACTCGGGTCA
361 CATCATCCAA CTATAATCCA TTTCTTGGTT GGGTGCATGG TGCTCAGATG GTAGCGTTCA
421 ATATGCAGGG ATATGGAAGA GCTCTTTGGT TGATGCATGG ATTTTATAAA GCTAATGGTG
481 GCTGTGGTTA TGTGAAGAAA CCAGATTTCT TAATGCAAAC TGATCCAGAG GTTTTTGACC
541 CAAAAAAATC CCTATCTCCC AAGAAAACCT TGAAGGTGAA AGTATACATG GGGGATGGTT
601 GGCGGATGGA CTTCACGCAG ACCCACTTTG ATCAATATTC TCCTCCAGAC TTTTATGCAC
661 GGGTGGGGAT AGCGGGAGTA CCAGCGGA
Predicted gene structure (within gDNA segment 47583 to 49722):
Exon 1 48157 48383 ( 227 n); cDNA 84 310 ( 227 n); score: 0.736
Intron 1 48384 48489 ( 106 n); Pd: 0.981 (s: 0.74), Pa: 0.477 (s: 0.82)
Exon 2 48490 48607 ( 118 n); cDNA 311 428 ( 118 n); score: 0.839
Intron 2 48608 48678 ( 71 n); Pd: 0.976 (s: 0.88), Pa: 0.976 (s: 0.80)
Exon 3 48679 48831 ( 153 n); cDNA 429 575 ( 147 n); score: 0.765
Intron 3 48832 48920 ( 89 n); Pd: 0.988 (s: 0.64), Pa: 0.739 (s: 0.82)
Exon 4 48921 49007 ( 87 n); cDNA 576 662 ( 87 n); score: 0.782
Intron 4 49008 49201 ( 194 n); Pd: 0.601 (s: 0.74), Pa: 0.941 (s: 0)
Exon 5 49202 49227 ( 26 n); cDNA 663 688 ( 26 n); score: 0.731
MATCH AC137075+ 27578307+ 0.771 611 0.888 C
PGS_AC137075+_27578307+ (48157 48383,48490 48607,48679 48831,48921 49007,49202 49227)
Alignment (genomic DNA sequence = upper lines):
GAGAATGATA TACTATACAC CCAAAGA-GA TGTGGAAGAA GATGATGAGA AGAAAATGTG 48215
|| |||| || | || |||| || | || || || || ||| ||||||||
GAAGATGA-C GACGACACCG ATGAAGACGA AGATGACGAG GAGGAGGAGC AGAAAATGCA 142
CCAGCATCAC CCACTAGAGT ATAAACACCT TATTACTATT AAGGCAGGAA AGCCAAAGGG 48275
||||||| | | | ||| | |||||||| |||||| || || ||||||| ||||||| ||
ACAGCATCTA GCTCCACAGT ACAAACACCT TATTACCATC AAAGCAGGAA AGCCAAAAGG 202
TGCTGTAGTT GATGCCTTAA AGGGTGATCC AGATAAAGTT AGACGCCTCA GTTTGAGTGA 48335
| || || || ||||||||| || |||| || ||| || ||| ||| | |||| ||||||| ||
TACTCTACTT GATGCCTTAC AGAGTGACCC AGAAAAGGTT AGAAGGCTCA GTTTGAGCGA 262
GCAGGAACTT GCAAAAGTGG CAGCGCATCA TGGTCGTAAC ATCGTGAGGT TCGTTTAGCA 48395
||| ||||| || ||| ||| ||| ||||| |||| | || || ||
GCAACAACTT GCCAAATTGG CAGATCATCA TGGTACCGAA ATTGTAAG.. .......... 310
AATATACTGA ATTTCGTAGC AAAGTATTTT CTATCATTGC ACCAGAGCTC TCTATGTCCA 48455
.......... .......... .......... .......... .......... .......... 310
TTGACCTTAA CTTCATTCTG TTTATTCAAA GCAGCTTTAC ACATAAAAAT CTTCTGAGAA 48515
|| || ||| | ||| || ||||| |
.......... .......... .......... ....GTTCAC ACAGAGAAAC CTACTGAGGA 336
TATACCCAAA GGGCACTCGC TTCAATTCTT CGAACTATAA TCCGTTTCTT GGTTGGGTGC 48575
|||||||||| ||||||||| ||| || | | |||||||| ||| |||||| ||||||||||
TATACCCAAA GGGCACTCGG GTCACATCAT CCAACTATAA TCCATTTCTT GGTTGGGTGC 396
ATGGTGCACA AATGGTGGCA TTTAATATGC AGGTACATTT CTAACATGAC ACTCCTCTGC 48635
||||||| || ||||| || || ||||||| ||
ATGGTGCTCA GATGGTAGCG TTCAATATGC AG........ .......... .......... 428
TACATCATAT TGGCCTGAAT GCCTGATACA TTTTTCTTCG CAGGGGTATG GAAGATCTCT 48695
|| |||| ||||| ||||
.......... .......... .......... .......... ...GGATATG GAAGAGCTCT 445
TTGGCTAATG CACGGATTCT ACAAGGCCAA CGGTGGCTGC GGTTATGTGA AGAAGCCAGA 48755
|||| | ||| || ||||| | | || || || |||||||| |||||||||| |||| |||||
TTGGTTGATG CATGGATTTT ATAAAGCTAA TGGTGGCTGT GGTTATGTGA AGAAACCAGA 505
TTTCATGATG CAAACTTGTC CAGATGGAAA TGTTTTTGAC CCGAAAGCAG ATTTACCTGT 48815
|||| | ||| |||||| || |||| | ||||||||| || ||| | || ||
TTTCTTAATG CAAACTGATC CAGA--G--- -GTTTTTGAC CCAAAAAAAT CCCTATCTCC 559
GAAGAAAACA CTCAAGGTAG GTTTGTGGCA TATGTTTCTT CCTTTCATTT TCATCTCTGA 48875
|||||||| | |||
CAAGAAAACC TTGAAG.... .......... .......... .......... .......... 575
AATTCAGGAA TCGAGCTACT TACAGCTTGC CTGTTTGTCT ACCAGGTCAA AGTATACATG 48935
|| || ||||||||||
.......... .......... .......... .......... .....GTGAA AGTATACATG 590
GGCGAAGGTT GGCAGAGCGA CTTCAAGCAG ACATACTTCG ACACGTATTC CCCTCCAGAC 48995
|| || |||| ||| || || ||||| |||| || |||| | | ||||| |||||||||
GGGGATGGTT GGCGGATGGA CTTCACGCAG ACCCACTTTG ATCAATATTC TCCTCCAGAC 650
TTCTACGCAA AGGTACATCG AATTTTACGC TGATGCCAAA CGCCAACAAA TTTGCAAATG 49055
|| || ||| |
TTTTATGCAC GG........ .......... .......... .......... .......... 662
CAAAACGGAG CTTTGAAAAA ACATGTATAT ATGTATAACT TTTACATATG GAGTGAGATG 49115
.......... .......... .......... .......... .......... .......... 662
AAGACAAACT TTATATCAAA ATTGTAGAGC TCCATGAGTT CTACGACGTT CTTATTGACT 49175
.......... .......... .......... .......... .......... .......... 662
AGTCCATCGT TCCATCATCA TAACAGGTGG GCATTGCCGG GGTTCCGTCG GA 49227
|||| | || || || || || || ||
.......... .......... ......GTGG GGATAGCGGG AGTACCAGCG GA 688
hqPGS_AC137075+_27578307+ (48157 48383,48490 48607,48679 48831,48921 49007,49202 49227)
<A NAME="PGS3@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 3 +strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=32971490" TARGET="NUCLEOTIDE SEQUENCE SEARCH">32971490+</A>)
1 CACCGATGAA GACGAAGATG ACGAGGAGGA GGAGCAGAAA ATGCAACAGC ATCTAGCTCC
61 ACAGTACAAA CACCTTATTA CCATCAAAGC AGGAAAGCCA AAAGGTACTC TACTTGATGC
121 CTTACAGAGT GACCCAGAAA AGGTTAGAAG GCTCAGTTTG AGCGAGCAAC AACTTGCCAA
181 ATTGGCAGAT CATCATGGTA CCGAAATTGT AAGGTTCACA CAGAGAAACC TACTGAGGAT
241 ATACCCAAAG GGCACTCGGG TCACATCATC CAACTATAAT CCATTTCTTG GTTGGGTGCA
301 TGGTGCTCAG ATGGTAGCGT TCAATATGCA GGGATATGGA AGAGCTCTTT GGTTGATGCA
361 TGGATTTTAT AAAGCTAATG GTGGCTGTGG TTATGTGAAG AAACCAGATT TCTTAATGCA
421 AACTGATCCA GAGGTTTTTG ACCCAAAAAA AATCCCTATC TCCCAAGAAA ACCTTGAAGG
481 TGAAAGTATA CATGGGGGAT GGTTGGCGGA TGGACTTCAC GCAGACCCAC TTTGATCAAT
541 ATTCTCCTCC AGACTTTTAT GCACGGGTGG GGATAGCGGG AGTACCAGCG GACTCGGTGA
601 TGAAGAGAAC GAGGGCGATA GAGGATAACT GGGTGCCGGT GTGGGAGGAG GATTTCACCT
661 TCAAACTGAC CGTGCCGGAG ATCGCGTTGC TGCGGGTGGA GGTGCACGAG TACGACATGT
721 CGGAGAAGGA CGACTTCGGC GGCCAGACGG TGCTGCCGGT GTCGGATCTC ATCCCGGGGA
781 TCCGAGCGGT GGCACTCCAC GACCGCAAAG GGATCAAGTT GAACAACGTC AAGCTTCTCA
841 TGCGCTTCGA GTTTGAATGA CCCAACACAC CGACACTTTC TTTCTTTCTC GCCGCATCGC
901 ATTGCACTGT GCCTGTGCTT GTGCAGCATC CATCATTTGG TTTGGTTTTT CATGTTCCTG
961 TGCATACGCA TTTGTGTCTG TACATAGGCT CGGTCCTGTA TATTGTTTGT GAGTAACATG
1021 TAATAATAAG GCTTCACGCC ATGTTCATTC CTCCTGTTTG AACCCCTCCA AATATATTAA
1081 TACCGAAGGT ACAAAATATC TC
Predicted gene structure (within gDNA segment 47603 to 51000):
Exon 1 48173 48383 ( 211 n); cDNA 2 213 ( 212 n); score: 0.749
Intron 1 48384 48489 ( 106 n); Pd: 0.981 (s: 0.74), Pa: 0.477 (s: 0.82)
Exon 2 48490 48607 ( 118 n); cDNA 214 331 ( 118 n); score: 0.839
Intron 2 48608 48678 ( 71 n); Pd: 0.976 (s: 0.88), Pa: 0.976 (s: 0.80)
Exon 3 48679 48831 ( 153 n); cDNA 332 479 ( 148 n); score: 0.752
Intron 3 48832 48920 ( 89 n); Pd: 0.988 (s: 0.59), Pa: 0.739 (s: 0.82)
Exon 4 48921 49007 ( 87 n); cDNA 480 566 ( 87 n); score: 0.782
Intron 4 49008 49201 ( 194 n); Pd: 0.601 (s: 0.74), Pa: 0.941 (s: 0.76)
Exon 5 49202 49500 ( 299 n); cDNA 567 869 ( 303 n); score: 0.763
MATCH AC137075+ 32971490+ 0.770 868 0.788 C
PGS_AC137075+_32971490+ (48173 48383,48490 48607,48679 48831,48921 49007,49202 49500)
Alignment (genomic DNA sequence = upper lines):
ACACCCAAAG A-GATGTGGA AGAAGATGAT GAGAAGAAAA TGTGCCAGCA TCACCCACTA 48231
|| ||| | || | || || || || ||| |||||| || ||||| || | | |
ACCGATGAAG ACGAAGATGA CGAGGAGGAG GAGCAGAAAA TGCAACAGCA TCTAGCTCCA 61
GAGTATAAAC ACCTTATTAC TATTAAGGCA GGAAAGCCAA AGGGTGCTGT AGTTGATGCC 48291
|||| |||| |||||||||| || || ||| |||||||||| | ||| || | | ||||||||
CAGTACAAAC ACCTTATTAC CATCAAAGCA GGAAAGCCAA AAGGTACTCT ACTTGATGCC 121
TTAAAGGGTG ATCCAGATAA AGTTAGACGC CTCAGTTTGA GTGAGCAGGA ACTTGCAAAA 48351
||| || ||| | ||||| || |||||| | |||||||||| | ||||| | |||||| |||
TTACAGAGTG ACCCAGAAAA GGTTAGAAGG CTCAGTTTGA GCGAGCAACA ACTTGCCAAA 181
GTGGCAGCGC ATCATGGTCG TAACATCGTG AGGTTCGTTT AGCAAATATA CTGAATTTCG 48411
|||||| | |||||||| | || || ||
TTGGCAGATC ATCATGGTAC CGAAATTGTA AG........ .......... .......... 213
TAGCAAAGTA TTTTCTATCA TTGCACCAGA GCTCTCTATG TCCATTGACC TTAACTTCAT 48471
.......... .......... .......... .......... .......... .......... 213
TCTGTTTATT CAAAGCAGCT TTACACATAA AAATCTTCTG AGAATATACC CAAAGGGCAC 48531
| | ||||| | ||| || ||| || ||||||| ||||||||||
.......... ........GT TCACACAGAG AAACCTACTG AGGATATACC CAAAGGGCAC 255
TCGCTTCAAT TCTTCGAACT ATAATCCGTT TCTTGGTTGG GTGCATGGTG CACAAATGGT 48591
||| ||| || || |||| ||||||| || |||||||||| |||||||||| | || |||||
TCGGGTCACA TCATCCAACT ATAATCCATT TCTTGGTTGG GTGCATGGTG CTCAGATGGT 315
GGCATTTAAT ATGCAGGTAC ATTTCTAACA TGACACTCCT CTGCTACATC ATATTGGCCT 48651
|| || ||| ||||||
AGCGTTCAAT ATGCAG.... .......... .......... .......... .......... 331
GAATGCCTGA TACATTTTTC TTCGCAGGGG TATGGAAGAT CTCTTTGGCT AATGCACGGA 48711
|| ||||||||| |||||||| | ||||| |||
.......... .......... .......GGA TATGGAAGAG CTCTTTGGTT GATGCATGGA 364
TTCTACAAGG CCAACGGTGG CTGCGGTTAT GTGAAGAAGC CAGATTTCAT GATGCAAACT 48771
|| || || | | || ||||| ||| |||||| |||||||| | |||||||| | |||||||||
TTTTATAAAG CTAATGGTGG CTGTGGTTAT GTGAAGAAAC CAGATTTCTT AATGCAAACT 424
TGTCCAGATG GAAATGTTTT TGACCC-GAA AGCAGATTTA CCTGTGAAGA AAACACTCAA 48830
|||||| | ||||| |||||| || | | || || |||| |||| | ||
GATCCAGA-- G----GTTTT TGACCCAAAA AAAATCCCTA TCTCCCAAGA AAACCTTGAA 478
GGTAGGTTTG TGGCATATGT TTCTTCCTTT CATTTTCATC TCTGAAATTC AGGAATCGAG 48890
|
G......... .......... .......... .......... .......... .......... 479
CTACTTACAG CTTGCCTGTT TGTCTACCAG GTCAAAGTAT ACATGGGCGA AGGTTGGCAG 48950
|| ||||||| ||||||| || ||||||| |
.......... .......... .......... GTGAAAGTAT ACATGGGGGA TGGTTGGCGG 509
AGCGACTTCA AGCAGACATA CTTCGACACG TATTCCCCTC CAGACTTCTA CGCAAAGGTA 49010
| ||||||| |||||| | ||| || ||||| |||| ||||||| || ||| |
ATGGACTTCA CGCAGACCCA CTTTGATCAA TATTCTCCTC CAGACTTTTA TGCACGG... 566
CATCGAATTT TACGCTGATG CCAAACGCCA ACAAATTTGC AAATGCAAAA CGGAGCTTTG 49070
.......... .......... .......... .......... .......... .......... 566
AAAAAACATG TATATATGTA TAACTTTTAC ATATGGAGTG AGATGAAGAC AAACTTTATA 49130
.......... .......... .......... .......... .......... .......... 566
TCAAAATTGT AGAGCTCCAT GAGTTCTACG ACGTTCTTAT TGACTAGTCC ATCGTTCCAT 49190
.......... .......... .......... .......... .......... .......... 566
CATCATAACA GGTGGGCATT GCCGGGGTTC CGTCGGACTC GGTGATGCAG AAGACGAAAG 49250
||||| || || || || | | ||||||| ||||||| || | |||| |
.......... .GTGGGGATA GCGGGAGTAC CAGCGGACTC GGTGATGAAG AGAACGAGGG 615
CCGTGGAGGA CAGCTGGGTT CCCGTGTGGG AGGAGGAGTT CGTGTTCCCG CTGACCGTCC 49310
| | ||||| | |||||| || ||||||| ||||||| || | ||| |||||||| |
CGATAGAGGA TAACTGGGTG CCGGTGTGGG AGGAGGATTT CACCTTCAAA CTGACCGTGC 675
CGGAGATCGC GCTGCTCCGC GTGGAGGTGC ACGAGTACGA C--GTGAGCG -AGGACGACT 49367
|||||||||| | |||| || |||||||||| |||||||||| | || | | |||||||||
CGGAGATCGC GTTGCTGCGG GTGGAGGTGC ACGAGTACGA CATGTCGGAG AAGGACGACT 735
TCGGCGGGCA GACGGCGCTC CCGGTGTCGG AGCTGCGGCC GGGGATCCGC ACCGTGCCGC 49427
||||||| || ||||| ||| |||||||||| | || || ||||||||| | ||| | |
TCGGCGGCCA GACGGTGCTG CCGGTGTCGG ATCTCATCCC GGGGATCCGA GCGGTGGCAC 795
TCTTCGACCA CAAGGGGCTC AAGTTCAAGA GCGTCAAGCT CCTCATGCGG TTCGAGTTCG 49487
|| ||||| ||| ||| || ||||| || | ||||||||| |||||||| |||||||| |
TCCACGACCG CAAAGGGATC AAGTTGAACA ACGTCAAGCT TCTCATGCGC TTCGAGTTTG 855
TCT-AGCAAA TTCA 49500
| | | || ||
AATGACCCAA CACA 869
hqPGS_AC137075+_32971490+ (48173 48383,48490 48607,48679 48831,48921 49007,49202 49500)
<A NAME="PGS4@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 4 +strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=33382000" TARGET="NUCLEOTIDE SEQUENCE SEARCH">33382000+</A>)
1 AAACACCTTA TTACCATCAA AGCAGGAAAG CCAAAAGGTA CTCTACTTGA TGCCTTGCAG
61 AGTGACCCAG AAAAGGTTAG AAGGCTCAGT TTGAGCGAGC AACAACTTGC CAAATTGGCA
121 GATCATCATG GTACCGAAAT TGTCAGGTTC ACACAGAGAA ACCTACTGAG GATATACCCA
181 AAGGGCACTC GGGTCACATC ATCCAACTAT AATCCATTTC TTGGTTGGGT ACATGGTGCT
241 CAGATGGTAG CGTTCAATAT GCAGGGATAT GGAAGAGCTC TTTGGTTGAT GCATGGATTT
301 TATAAAGCTA ATGGTGGCTG TGGTTATGTG AAGAAACCAG ATTTCTTAAT GCAAACTGAT
361 CCAGAGGTTT TTGACCCAAA AAAATCCCTA TCTCCCAAGA AAACCTTGAA GGTGAAAGTA
421 TACATGGGGG ATGGTTGGCG GATGGACTTC ACGCAGACCC ACTTTGATCA ATATTCTCCT
481 CCAGACTTTT ATGCACGGGT GGGGATAGCG GGAGTACNNN CGGACTCGGT GANNNNNNNN
541 ACNNNNNCGA TAGANNATAA CTGGGTGCCG GTGTG
Predicted gene structure (within gDNA segment 47938 to 49755):
Exon 1 48238 48383 ( 146 n); cDNA 1 146 ( 146 n); score: 0.801
Intron 1 48384 48489 ( 106 n); Pd: 0.981 (s: 0.74), Pa: 0.477 (s: 0.82)
Exon 2 48490 48607 ( 118 n); cDNA 147 264 ( 118 n); score: 0.831
Intron 2 48608 48678 ( 71 n); Pd: 0.976 (s: 0.86), Pa: 0.976 (s: 0.80)
Exon 3 48679 48831 ( 153 n); cDNA 265 411 ( 147 n); score: 0.765
Intron 3 48832 48920 ( 89 n); Pd: 0.988 (s: 0.64), Pa: 0.739 (s: 0.82)
Exon 4 48921 49007 ( 87 n); cDNA 412 498 ( 87 n); score: 0.782
Intron 4 49008 49201 ( 194 n); Pd: 0.601 (s: 0.74), Pa: 0.941 (s: 0.58)
Exon 5 49202 49278 ( 77 n); cDNA 499 575 ( 77 n); score: 0.610
MATCH AC137075+ 33382000+ 0.769 581 1.010 C
PGS_AC137075+_33382000+ (48238 48383,48490 48607,48679 48831,48921 49007,49202 49278)
Alignment (genomic DNA sequence = upper lines):
AAACACCTTA TTACTATTAA GGCAGGAAAG CCAAAGGGTG CTGTAGTTGA TGCCTTAAAG 48297
|||||||||| |||| || || ||||||||| ||||| ||| || || |||| |||||| ||
AAACACCTTA TTACCATCAA AGCAGGAAAG CCAAAAGGTA CTCTACTTGA TGCCTTGCAG 60
GGTGATCCAG ATAAAGTTAG ACGCCTCAGT TTGAGTGAGC AGGAACTTGC AAAAGTGGCA 48357
|||| |||| | || ||||| | | |||||| ||||| |||| | ||||||| ||| |||||
AGTGACCCAG AAAAGGTTAG AAGGCTCAGT TTGAGCGAGC AACAACTTGC CAAATTGGCA 120
GCGCATCATG GTCGTAACAT CGTGAGGTTC GTTTAGCAAA TATACTGAAT TTCGTAGCAA 48417
| ||||||| || | || || ||
GATCATCATG GTACCGAAAT TGTCAG.... .......... .......... .......... 146
AGTATTTTCT ATCATTGCAC CAGAGCTCTC TATGTCCATT GACCTTAACT TCATTCTGTT 48477
.......... .......... .......... .......... .......... .......... 146
TATTCAAAGC AGCTTTACAC ATAAAAATCT TCTGAGAATA TACCCAAAGG GCACTCGCTT 48537
|| |||| | | ||| || ||||| ||| |||||||||| ||||||| |
.......... ..GTTCACAC AGAGAAACCT ACTGAGGATA TACCCAAAGG GCACTCGGGT 194
CAATTCTTCG AACTATAATC CGTTTCTTGG TTGGGTGCAT GGTGCACAAA TGGTGGCATT 48597
|| || || |||||||||| | |||||||| |||||| ||| ||||| || | |||| || ||
CACATCATCC AACTATAATC CATTTCTTGG TTGGGTACAT GGTGCTCAGA TGGTAGCGTT 254
TAATATGCAG GTACATTTCT AACATGACAC TCCTCTGCTA CATCATATTG GCCTGAATGC 48657
|||||||||
CAATATGCAG .......... .......... .......... .......... .......... 264
CTGATACATT TTTCTTCGCA GGGGTATGGA AGATCTCTTT GGCTAATGCA CGGATTCTAC 48717
|| |||||| ||| |||||| || | ||||| ||||| ||
.......... .......... .GGATATGGA AGAGCTCTTT GGTTGATGCA TGGATTTTAT 303
AAGGCCAACG GTGGCTGCGG TTATGTGAAG AAGCCAGATT TCATGATGCA AACTTGTCCA 48777
|| || || | ||||||| || |||||||||| || ||||||| || | ||||| |||| ||||
AAAGCTAATG GTGGCTGTGG TTATGTGAAG AAACCAGATT TCTTAATGCA AACTGATCCA 363
GATGGAAATG TTTTTGACCC GAAAGCAGAT TTACCTGTGA AGAAAACACT CAAGGTAGGT 48837
|| | | |||||||||| ||| | || || | ||||||| | |||
GA--G----G TTTTTGACCC AAAAAAATCC CTATCTCCCA AGAAAACCTT GAAG...... 411
TTGTGGCATA TGTTTCTTCC TTTCATTTTC ATCTCTGAAA TTCAGGAATC GAGCTACTTA 48897
.......... .......... .......... .......... .......... .......... 411
CAGCTTGCCT GTTTGTCTAC CAGGTCAAAG TATACATGGG CGAAGGTTGG CAGAGCGACT 48957
|| |||| |||||||||| || |||||| | || ||||
.......... .......... ...GTGAAAG TATACATGGG GGATGGTTGG CGGATGGACT 448
TCAAGCAGAC ATACTTCGAC ACGTATTCCC CTCCAGACTT CTACGCAAAG GTACATCGAA 49017
||| |||||| |||| || ||||| | |||||||||| || ||| |
TCACGCAGAC CCACTTTGAT CAATATTCTC CTCCAGACTT TTATGCACGG .......... 498
TTTTACGCTG ATGCCAAACG CCAACAAATT TGCAAATGCA AAACGGAGCT TTGAAAAAAC 49077
.......... .......... .......... .......... .......... .......... 498
ATGTATATAT GTATAACTTT TACATATGGA GTGAGATGAA GACAAACTTT ATATCAAAAT 49137
.......... .......... .......... .......... .......... .......... 498
TGTAGAGCTC CATGAGTTCT ACGACGTTCT TATTGACTAG TCCATCGTTC CATCATCATA 49197
.......... .......... .......... .......... .......... .......... 498
ACAGGTGGGC ATTGCCGGGG TTCCGTCGGA CTCGGTGATG CAGAAGACGA AAGCCGTGGA 49257
||||| || || || | | | |||| |||||||| || | | ||
....GTGGGG ATAGCGGGAG TACNNNCGGA CTCGGTGANN NNNNNNACNN NNNCGATAGA 554
GGACAGCTGG GTTCCCGTGT G 49278
| | |||| || || |||| |
NNATAACTGG GTGCCGGTGT G 575
hqPGS_AC137075+_33382000+ (48238 48383,48490 48607,48679 48831,48921 49007,49202 49278)
<A NAME="PGS1@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 1 +strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=14571346" TARGET="NUCLEOTIDE SEQUENCE SEARCH">14571346+</A>)
1 ACCGAAATTG TCAGGTTCAC ACAGAGAAAC CTACTGAGGA TATACCCAAA GGGCACTCGG
61 GTCACATCAT CCAACTATAA TCCATTTCTT GGTTGGGTGC ATGGTGCTCA GATGGTAGCG
121 TTCAATATGC AGGGATATGG AAGAGCTCTT TGGTTGATGC ATGGATTTTA TAAAGCTAAT
181 GGTGGCTGTG GTTATGTGAA GAAACCAGAT TTCTTAATGC AAACTGATCC AGAGGTTTTT
241 GACCCAAAAA AATCCCTATC TTCCAAGAAA ACCTTGAAGG TGAAAGTATA CATGGGGGAT
301 GGTTGGCGGG TGGACTTCAC GCAAGACCCA CTTTGATCAA TATTCTTCTT CAAACTTTTA
361 TGCACGGGTG GGGATAGCGG GAGTACCTCG GCCGCGACCA CGCTA
Predicted gene structure (within gDNA segment 48020 to 49787):
Exon 1 48376 48383 ( 8 n); cDNA 7 14 ( 8 n); score: 0.750
Intron 1 48384 48489 ( 106 n); Pd: 0.981 (s: 0), Pa: 0.477 (s: 0.82)
Exon 2 48490 48607 ( 118 n); cDNA 15 132 ( 118 n); score: 0.839
Intron 2 48608 48678 ( 71 n); Pd: 0.976 (s: 0.88), Pa: 0.976 (s: 0.80)
Exon 3 48679 48831 ( 153 n); cDNA 133 279 ( 147 n); score: 0.765
Intron 3 48832 48920 ( 89 n); Pd: 0.988 (s: 0.64), Pa: 0.739 (s: 0.76)
Exon 4 48921 49007 ( 87 n); cDNA 280 367 ( 88 n); score: 0.713
Intron 4 49008 49201 ( 194 n); Pd: 0.601 (s: 0.63), Pa: 0.941 (s: 0)
Exon 5 49202 49238 ( 37 n); cDNA 368 403 ( 36 n); score: 0.676
MATCH AC137075+ 14571346+ 0.777 403 0.995 C
PGS_AC137075+_14571346+ (48376 48383,48490 48607,48679 48831,48921 49007,49202 49238)
Alignment (genomic DNA sequence = upper lines):
ATCGTGAGGT TCGTTTAGCA AATATACTGA ATTTCGTAGC AAAGTATTTT CTATCATTGC 48435
|| || ||
ATTGTCAG.. .......... .......... .......... .......... .......... 14
ACCAGAGCTC TCTATGTCCA TTGACCTTAA CTTCATTCTG TTTATTCAAA GCAGCTTTAC 48495
|| ||
.......... .......... .......... .......... .......... ....GTTCAC 20
ACATAAAAAT CTTCTGAGAA TATACCCAAA GGGCACTCGC TTCAATTCTT CGAACTATAA 48555
||| | ||| || ||||| | |||||||||| ||||||||| ||| || | | ||||||||
ACAGAGAAAC CTACTGAGGA TATACCCAAA GGGCACTCGG GTCACATCAT CCAACTATAA 80
TCCGTTTCTT GGTTGGGTGC ATGGTGCACA AATGGTGGCA TTTAATATGC AGGTACATTT 48615
||| |||||| |||||||||| ||||||| || ||||| || || ||||||| ||
TCCATTTCTT GGTTGGGTGC ATGGTGCTCA GATGGTAGCG TTCAATATGC AG........ 132
CTAACATGAC ACTCCTCTGC TACATCATAT TGGCCTGAAT GCCTGATACA TTTTTCTTCG 48675
.......... .......... .......... .......... .......... .......... 132
CAGGGGTATG GAAGATCTCT TTGGCTAATG CACGGATTCT ACAAGGCCAA CGGTGGCTGC 48735
|| |||| ||||| |||| |||| | ||| || ||||| | | || || || ||||||||
...GGATATG GAAGAGCTCT TTGGTTGATG CATGGATTTT ATAAAGCTAA TGGTGGCTGT 189
GGTTATGTGA AGAAGCCAGA TTTCATGATG CAAACTTGTC CAGATGGAAA TGTTTTTGAC 48795
|||||||||| |||| ||||| |||| | ||| |||||| || |||| | |||||||||
GGTTATGTGA AGAAACCAGA TTTCTTAATG CAAACTGATC CAGA--G--- -GTTTTTGAC 243
CCGAAAGCAG ATTTACCTGT GAAGAAAACA CTCAAGGTAG GTTTGTGGCA TATGTTTCTT 48855
|| ||| | || || |||||||| | |||
CCAAAAAAAT CCCTATCTTC CAAGAAAACC TTGAAG.... .......... .......... 279
CCTTTCATTT TCATCTCTGA AATTCAGGAA TCGAGCTACT TACAGCTTGC CTGTTTGTCT 48915
.......... .......... .......... .......... .......... .......... 279
ACCAGGTCAA AGTATACATG GGCGAAGGTT GGCAGAGCGA CTTCAAGC-A GACATACTTC 48974
|| || |||||||||| || || |||| ||| | || ||||| || | ||| ||||
.....GTGAA AGTATACATG GGGGATGGTT GGCGGGTGGA CTTCACGCAA GACCCACTTT 334
GACACGTATT CCCCTCCAGA CTTCTACGCA AAGGTACATC GAATTTTACG CTGATGCCAA 49034
|| |||| | || || | ||| || ||| |
GATCAATATT CTTCTTCAAA CTTTTATGCA CGG....... .......... .......... 367
ACGCCAACAA ATTTGCAAAT GCAAAACGGA GCTTTGAAAA AACATGTATA TATGTATAAC 49094
.......... .......... .......... .......... .......... .......... 367
TTTTACATAT GGAGTGAGAT GAAGACAAAC TTTATATCAA AATTGTAGAG CTCCATGAGT 49154
.......... .......... .......... .......... .......... .......... 367
TCTACGACGT TCTTATTGAC TAGTCCATCG TTCCATCATC ATAACAGGTG GGCATTGCCG 49214
||| || || || |
.......... .......... .......... .......... .......GTG GGGATAGCGG 380
GGGTTCCGTC GGACTCGGTG ATGC 49238
| || || || || | || | ||
GAGTACC-TC GGCCGCGACC ACGC 403
hqPGS_AC137075+_14571346+ (48376 48383,48490 48607,48679 48831,48921 49007,49202 49238)
<A NAME="PGS9@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 9 -strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=29656473" TARGET="NUCLEOTIDE SEQUENCE SEARCH">29656473-</A>)
1 CATCATCCAA CTATAATCCA TTTCTTGGTT GGGTGCATGG TGCTCAGATG GTAGCGTTCA
61 ATATGCAGGG ATATGGAAGA GCTCTTTGGT TGATGCATGG ATTTTATAAA GCTAATGGTG
121 GCTGTGGTTA TGTGAAGAAA CCAGATTTCT TAATGCAAAC TGATCCAGAG GTTTTTGACC
181 CAAAAAAATC CCTATCTCCC AAGAAAACCT TGAAGGTGAA AGTATACATG GGGGATGGTT
241 GGCGGATGGA CTTCACGCAG ACCCACTTTG ATCAATATTC TCCTCCAGAC TTTTATGCAC
301 GGGTGGGGAT AGCGGGAGTA CCAGCGGACT CGGTGATGAA GAGAACGAGG GCGATAGAGG
361 ATAACTGGGT GCCGGTGTGG GAGGAGGATT TCACCTTCAA ACTGACCGTG CCGGAGATCG
421 CGTTGCTGCG GGTGGAGGTG CACGAGTACG ACATGTCGGA GAAGGACGAC TTCGGCGGCC
481 AGACGGTGCT GCCGGTGTCG GATCTCATCC CGGGGATCCG AGCGGTGGCA CTCCACGACC
541 GCAAAGGGAT CAAGTTGAAC AACGTCAAGC TTTTCATGCG CTTCGAGTTT GAATGACCCA
601 ACACACCGAC ACTTTCTTTC TTTCTCGCCG CATCGCATTG CACTGTGCCT GTGCTTGTGC
661 AGCATCCATC ATTTGGTTTG GTTTTTCATG TTCCTGTGCA TACGCATTTG TGTCTGTACA
721 TAGGCTCGGT CCTGTATATT GTTTGTGAGG GGCATGTAAT AATAAGGCTT CACGCCATGT
781 TCAAATAAAA AAG
Predicted gene structure (within gDNA segment 48155 to 51000):
Exon 1 48542 48607 ( 66 n); cDNA 3 68 ( 66 n); score: 0.879
Intron 1 48608 48678 ( 71 n); Pd: 0.976 (s: 0.88), Pa: 0.976 (s: 0.80)
Exon 2 48679 48831 ( 153 n); cDNA 69 215 ( 147 n); score: 0.765
Intron 2 48832 48920 ( 89 n); Pd: 0.988 (s: 0.64), Pa: 0.739 (s: 0.82)
Exon 3 48921 49007 ( 87 n); cDNA 216 302 ( 87 n); score: 0.782
Intron 3 49008 49201 ( 194 n); Pd: 0.601 (s: 0.74), Pa: 0.941 (s: 0.76)
Exon 4 49202 49500 ( 299 n); cDNA 303 605 ( 303 n); score: 0.759
MATCH AC137075+ 29656473- 0.777 605 0.763 C
PGS_AC137075+_29656473- (48542 48607,48679 48831,48921 49007,49202 49500)
Alignment (genomic DNA sequence = upper lines):
TCTTCGAACT ATAATCCGTT TCTTGGTTGG GTGCATGGTG CACAAATGGT GGCATTTAAT 48601
|| || |||| ||||||| || |||||||||| |||||||||| | || ||||| || || |||
TCATCCAACT ATAATCCATT TCTTGGTTGG GTGCATGGTG CTCAGATGGT AGCGTTCAAT 62
ATGCAGGTAC ATTTCTAACA TGACACTCCT CTGCTACATC ATATTGGCCT GAATGCCTGA 48661
||||||
ATGCAG.... .......... .......... .......... .......... .......... 68
TACATTTTTC TTCGCAGGGG TATGGAAGAT CTCTTTGGCT AATGCACGGA TTCTACAAGG 48721
|| ||||||||| |||||||| | ||||| ||| || || || |
.......... .......GGA TATGGAAGAG CTCTTTGGTT GATGCATGGA TTTTATAAAG 111
CCAACGGTGG CTGCGGTTAT GTGAAGAAGC CAGATTTCAT GATGCAAACT TGTCCAGATG 48781
| || ||||| ||| |||||| |||||||| | |||||||| | ||||||||| ||||||
CTAATGGTGG CTGTGGTTAT GTGAAGAAAC CAGATTTCTT AATGCAAACT GATCCAGA-- 169
GAAATGTTTT TGACCCGAAA GCAGATTTAC CTGTGAAGAA AACACTCAAG GTAGGTTTGT 48841
| ||||| |||||| ||| | || || ||||| ||| | |||
G----GTTTT TGACCCAAAA AAATCCCTAT CTCCCAAGAA AACCTTGAAG .......... 215
GGCATATGTT TCTTCCTTTC ATTTTCATCT CTGAAATTCA GGAATCGAGC TACTTACAGC 48901
.......... .......... .......... .......... .......... .......... 215
TTGCCTGTTT GTCTACCAGG TCAAAGTATA CATGGGCGAA GGTTGGCAGA GCGACTTCAA 48961
| | |||||||| |||||| || ||||||| || |||||||
.......... .........G TGAAAGTATA CATGGGGGAT GGTTGGCGGA TGGACTTCAC 256
GCAGACATAC TTCGACACGT ATTCCCCTCC AGACTTCTAC GCAAAGGTAC ATCGAATTTT 49021
|||||| || || || | |||| ||||| |||||| || ||| |
GCAGACCCAC TTTGATCAAT ATTCTCCTCC AGACTTTTAT GCACGG.... .......... 302
ACGCTGATGC CAAACGCCAA CAAATTTGCA AATGCAAAAC GGAGCTTTGA AAAAACATGT 49081
.......... .......... .......... .......... .......... .......... 302
ATATATGTAT AACTTTTACA TATGGAGTGA GATGAAGACA AACTTTATAT CAAAATTGTA 49141
.......... .......... .......... .......... .......... .......... 302
GAGCTCCATG AGTTCTACGA CGTTCTTATT GACTAGTCCA TCGTTCCATC ATCATAACAG 49201
.......... .......... .......... .......... .......... .......... 302
GTGGGCATTG CCGGGGTTCC GTCGGACTCG GTGATGCAGA AGACGAAAGC CGTGGAGGAC 49261
||||| || | | || || || |||||||| |||||| ||| |||| || | |||||
GTGGGGATAG CGGGAGTACC AGCGGACTCG GTGATGAAGA GAACGAGGGC GATAGAGGAT 362
AGCTGGGTTC CCGTGTGGGA GGAGGAGTTC GTGTTCCCGC TGACCGTCCC GGAGATCGCG 49321
| |||||| | | |||||||| |||||| ||| ||| | ||||||| || ||||||||||
AACTGGGTGC CGGTGTGGGA GGAGGATTTC ACCTTCAAAC TGACCGTGCC GGAGATCGCG 422
CTGCTCCGCG TGGAGGTGCA CGAGTACGAC --GTGAGCG- AGGACGACTT CGGCGGGCAG 49378
|||| || | |||||||||| |||||||||| || | | |||||||||| |||||| |||
TTGCTGCGGG TGGAGGTGCA CGAGTACGAC ATGTCGGAGA AGGACGACTT CGGCGGCCAG 482
ACGGCGCTCC CGGTGTCGGA GCTGCGGCCG GGGATCCGCA CCGTGCCGCT CTTCGACCAC 49438
|||| ||| | |||||||||| || ||| |||||||| | ||| | || | ||||| |
ACGGTGCTGC CGGTGTCGGA TCTCATCCCG GGGATCCGAG CGGTGGCACT CCACGACCGC 542
AAGGGGCTCA AGTTCAAGAG CGTCAAGCTC CTCATGCGGT TCGAGTTCGT CT-AGCAAAT 49497
|| ||| ||| |||| || | ||||||||| ||||||| | ||||||| | | | | ||
AAAGGGATCA AGTTGAACAA CGTCAAGCTT TTCATGCGCT TCGAGTTTGA ATGACCCAAC 602
TCA 49500
||
ACA 605
hqPGS_AC137075+_29656473- (48542 48607,48679 48831,48921 49007,49202 49500)
<A NAME="PGS10@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 10 -strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=29663931" TARGET="NUCLEOTIDE SEQUENCE SEARCH">29663931-</A>)
1 CAGATGGTAG CGTTCAATAT GCAGGGATAT GGAAGAGCTC TTTGGTTGAT GCATGGATTT
61 TATAAAGCTA ATGGTGGCTG TGGTTATGTG AAGAAACCAG ATTTCTTAAT GCAAACTGAT
121 CCAGAGGTTT TTGACCCAAA AAAATCCCTA TCTCCCAAGA AAACCTTGAA GGTGAAAGTA
181 TACATGGGGG ATGGTTGGCG GATGGACTTC ACGCAGACCC ACTTTGATCA ATATTCTCCT
241 CCAGACTTTT ATGCACGGGT GGGGATAGCG GGAGTACCAG CGGACTCGGT GATGAAGAGA
301 ACGAGGGCGA TAGAGGATAA CTGGGTGCCG GTGTGGGAGG AGGATTTCAC CTTCAAACTG
361 ACCGTGCCGG AGATCGCGTT GCTGCGGGTG GAGGTGCACG AGTACGACAT GTCGGAGAAG
421 GACGACTTCG GCGGCCAGAC GGTGCTGCCG GTGTCGGATC TCATCCCGGG GATCCGAGCG
481 GTGGCACTCC ACGACCGCAA AGGGATCAAG TTGAACAACG TCAAGCTTTT CATGCGCTTC
541 GAGTTTGAAT GACCCAACAC ACCGACACTT TCTTTCTTTC TCGCCGCATC GCATTGCACT
601 GTGCCTGTGC TTGTGCAGCA TCCATCATTT GGTTTGGTTT TTCATGTTCC TGTGCATACG
661 CATTTGTGTC TGTACATAGG CTCGGTCCTG TATATTGTTT GTGAGTAACA TGTGATAATA
721 AGGCTTCACG CCATGTTCAT TCC
Predicted gene structure (within gDNA segment 48026 to 51000):
Exon 1 48584 48607 ( 24 n); cDNA 1 24 ( 24 n); score: 0.833
Intron 1 48608 48678 ( 71 n); Pd: 0.976 (s: 0), Pa: 0.976 (s: 0.80)
Exon 2 48679 48831 ( 153 n); cDNA 25 171 ( 147 n); score: 0.765
Intron 2 48832 48920 ( 89 n); Pd: 0.988 (s: 0.64), Pa: 0.739 (s: 0.82)
Exon 3 48921 49007 ( 87 n); cDNA 172 258 ( 87 n); score: 0.782
Intron 3 49008 49201 ( 194 n); Pd: 0.601 (s: 0.74), Pa: 0.941 (s: 0.76)
Exon 4 49202 49500 ( 299 n); cDNA 259 561 ( 303 n); score: 0.759
MATCH AC137075+ 29663931- 0.764 563 0.758 C
PGS_AC137075+_29663931- (48584 48607,48679 48831,48921 49007,49202 49500)
Alignment (genomic DNA sequence = upper lines):
CAAATGGTGG CATTTAATAT GCAGGTACAT TTCTAACATG ACACTCCTCT GCTACATCAT 48643
|| ||||| | | || ||||| ||||
CAGATGGTAG CGTTCAATAT GCAG...... .......... .......... .......... 24
ATTGGCCTGA ATGCCTGATA CATTTTTCTT CGCAGGGGTA TGGAAGATCT CTTTGGCTAA 48703
|| || ||||||| || |||||| | |
.......... .......... .......... .....GGATA TGGAAGAGCT CTTTGGTTGA 49
TGCACGGATT CTACAAGGCC AACGGTGGCT GCGGTTATGT GAAGAAGCCA GATTTCATGA 48763
|||| ||||| || || || || ||||||| | |||||||| |||||| ||| |||||| | |
TGCATGGATT TTATAAAGCT AATGGTGGCT GTGGTTATGT GAAGAAACCA GATTTCTTAA 109
TGCAAACTTG TCCAGATGGA AATGTTTTTG ACCCGAAAGC AGATTTACCT GTGAAGAAAA 48823
|||||||| |||||| | ||||||| |||| ||| | || || |||||||
TGCAAACTGA TCCAGA--G- ---GTTTTTG ACCCAAAAAA ATCCCTATCT CCCAAGAAAA 163
CACTCAAGGT AGGTTTGTGG CATATGTTTC TTCCTTTCAT TTTCATCTCT GAAATTCAGG 48883
| | |||
CCTTGAAG.. .......... .......... .......... .......... .......... 171
AATCGAGCTA CTTACAGCTT GCCTGTTTGT CTACCAGGTC AAAGTATACA TGGGCGAAGG 48943
|| |||||||||| |||| || ||
.......... .......... .......... .......GTG AAAGTATACA TGGGGGATGG 194
TTGGCAGAGC GACTTCAAGC AGACATACTT CGACACGTAT TCCCCTCCAG ACTTCTACGC 49003
||||| || ||||||| || |||| |||| || ||| || ||||||| |||| || ||
TTGGCGGATG GACTTCACGC AGACCCACTT TGATCAATAT TCTCCTCCAG ACTTTTATGC 254
AAAGGTACAT CGAATTTTAC GCTGATGCCA AACGCCAACA AATTTGCAAA TGCAAAACGG 49063
| |
ACGG...... .......... .......... .......... .......... .......... 258
AGCTTTGAAA AAACATGTAT ATATGTATAA CTTTTACATA TGGAGTGAGA TGAAGACAAA 49123
.......... .......... .......... .......... .......... .......... 258
CTTTATATCA AAATTGTAGA GCTCCATGAG TTCTACGACG TTCTTATTGA CTAGTCCATC 49183
.......... .......... .......... .......... .......... .......... 258
GTTCCATCAT CATAACAGGT GGGCATTGCC GGGGTTCCGT CGGACTCGGT GATGCAGAAG 49243
|| ||| || || || || || |||||||||| |||| |||
.......... ........GT GGGGATAGCG GGAGTACCAG CGGACTCGGT GATGAAGAGA 300
ACGAAAGCCG TGGAGGACAG CTGGGTTCCC GTGTGGGAGG AGGAGTTCGT GTTCCCGCTG 49303
|||| || | ||||| | |||||| || |||||||||| |||| ||| ||| |||
ACGAGGGCGA TAGAGGATAA CTGGGTGCCG GTGTGGGAGG AGGATTTCAC CTTCAAACTG 360
ACCGTCCCGG AGATCGCGCT GCTCCGCGTG GAGGTGCACG AGTACGAC-- GTGAGCG-AG 49360
||||| |||| |||||||| | ||| || ||| |||||||||| |||||||| || | | ||
ACCGTGCCGG AGATCGCGTT GCTGCGGGTG GAGGTGCACG AGTACGACAT GTCGGAGAAG 420
GACGACTTCG GCGGGCAGAC GGCGCTCCCG GTGTCGGAGC TGCGGCCGGG GATCCGCACC 49420
|||||||||| |||| ||||| || ||| ||| |||||||| | | ||||| |||||| |
GACGACTTCG GCGGCCAGAC GGTGCTGCCG GTGTCGGATC TCATCCCGGG GATCCGAGCG 480
GTGCCGCTCT TCGACCACAA GGGGCTCAAG TTCAAGAGCG TCAAGCTCCT CATGCGGTTC 49480
||| | ||| ||||| ||| ||| ||||| || || | || ||||||| | |||||| |||
GTGGCACTCC ACGACCGCAA AGGGATCAAG TTGAACAACG TCAAGCTTTT CATGCGCTTC 540
GAGTTCGTCT -AGCAAATTC A 49500
||||| | | | | || | |
GAGTTTGAAT GACCCAACAC A 561
hqPGS_AC137075+_29663931- (48584 48607,48679 48831,48921 49007,49202 49500)
<A NAME="PGS5@AC137075-45000-50999"></A>
********************************************************************************
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll down to "Predicted gene locations"</A>
EST sequence 5 +strand (File: <A HREF="http://www.plantgdb.org/data.php?Seq_ID=2797925" TARGET="NUCLEOTIDE SEQUENCE SEARCH">2797925+</A>)
1 GCCTTTCAGG TGAAAGTATA CATGGGGGAT GGTTGGCGGA TGGACTTCAC GCAGACCCAC
61 TTTGATCAAT TAACTCCTCC AGACTTTTAT GCACGGGTGG GGATAGCGGG AGTACCAGCG
121 ACTCGGTGAT GAAGAGAACG AGGGCGATAG AGGATAACTG GGTTGCCGGT GTGGGAGGAG
181 GATTTCACCT TCAAACTGAC CGTTNCGAGA TCTGCGTTGC TGCGGGTTGG A
Predicted gene structure (within gDNA segment 48564 to 49837):
Exon 1 48914 49007 ( 94 n); cDNA 3 96 ( 94 n); score: 0.745
Intron 1 49008 49201 ( 194 n); Pd: 0.601 (s: 0.68), Pa: 0.941 (s: 0.74)
Exon 2 49202 49329 ( 128 n); cDNA 97 224 ( 128 n); score: 0.727
MATCH AC137075+ 2797925+ 0.734 222 0.961 C
PGS_AC137075+_2797925+ (48914 49007,49202 49329)
Alignment (genomic DNA sequence = upper lines):
CTACCAGGTC AAAGTATACA TGGGCGAAGG TTGGCAGAGC GACTTCAAGC AGACATACTT 48973
|| ||||| |||||||||| |||| || || ||||| || ||||||| || |||| ||||
CTTTCAGGTG AAAGTATACA TGGGGGATGG TTGGCGGATG GACTTCACGC AGACCCACTT 62
CGACACGTAT TCCCCTCCAG ACTTCTACGC AAAGGTACAT CGAATTTTAC GCTGATGCCA 49033
|| | | ||||||| |||| || || | |
TGATCAATTA ACTCCTCCAG ACTTTTATGC ACGG...... .......... .......... 96
AACGCCAACA AATTTGCAAA TGCAAAACGG AGCTTTGAAA AAACATGTAT ATATGTATAA 49093
.......... .......... .......... .......... .......... .......... 96
CTTTTACATA TGGAGTGAGA TGAAGACAAA CTTTATATCA AAATTGTAGA GCTCCATGAG 49153
.......... .......... .......... .......... .......... .......... 96
TTCTACGACG TTCTTATTGA CTAGTCCATC GTTCCATCAT CATAACAGGT GGGCATTGCC 49213
|| ||| || ||
.......... .......... .......... .......... ........GT GGGGATAGCG 108
GGGGTTCCGT CGGACTCGGT GATGCAGAAG ACGAAAGCCG TGGAGGACAG CTGGGTT-CC 49272
|| || || | |||||||| |||| ||| |||| || | ||||| | ||||||| ||
GGAGTACCAG C-GACTCGGT GATGAAGAGA ACGAGGGCGA TAGAGGATAA CTGGGTTGCC 167
CGTGTGGGAG GAGGAGTTCG TGTTCCCGCT GACCGTCCCG GAGATC-GCG CTGCTCCG 49329
||||||||| ||||| ||| ||| || |||||| | |||||| ||| |||| ||
GGTGTGGGAG GAGGATTTCA CCTTCAAACT GACCGTTNC- GAGATCTGCG TTGCTGCG 224
hqPGS_AC137075+_2797925+ (48914 49007,49202 49329)
<A NAME="HEAD-PGL-AC137075-45000-50999"></A>
Total number of EST alignments reported: 10
________________________________________________________________________________
<A NAME="PGL1@AC137075-45000-50999"></A>
Predicted gene locations (1) in segment 45001 to 51000:
Scroll up to <A HREF="#TOP">top</A>
Scroll down to PGL <A HREF="#PGL1@AC137075-45000-50999">1</A>
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll back to "Predicted gene locations"</A>
<A HREF="#BOTTOM-PGL-AC137075-45000-50999">Scroll down to next segment</A>
PGL 1 (+ strand): 47302 49500
<A NAME="PGL1_AGS0@AC137075-45000-50999"></A>
AGS-1 (47302 47501,47601 47736,47814 48074,48151 48383,48490 48607,48679 48831,48921 49007,49202 49500)
SCR (e 0.750 d 0.981 a 0.943,e 0.721 d 0.967 a 0.462,e 0.651 d 0.950 a 0.864,e 0.749 d 0.981 a 0.477,e 0.839 d 0.976 a 0.976,e 0.765 d 0.988 a 0.739,e 0.782 d 0.601 a 0.941,e 0.766)
Exon 1 47302 47501 ( 200 n); score: 0.750
Intron 1 47502 47600 ( 99 n); Pd: 0.981 Pa: 0.943
Exon 2 47601 47736 ( 136 n); score: 0.721
Intron 2 47737 47813 ( 77 n); Pd: 0.967 Pa: 0.462
Exon 3 47814 48074 ( 261 n); score: 0.651
Intron 3 48075 48150 ( 76 n); Pd: 0.950 Pa: 0.864
Exon 4 48151 48383 ( 233 n); score: 0.749
Intron 4 48384 48489 ( 106 n); Pd: 0.981 Pa: 0.477
Exon 5 48490 48607 ( 118 n); score: 0.839
Intron 5 48608 48678 ( 71 n); Pd: 0.976 Pa: 0.976
Exon 6 48679 48831 ( 153 n); score: 0.765
Intron 6 48832 48920 ( 89 n); Pd: 0.988 Pa: 0.739
Exon 7 48921 49007 ( 87 n); score: 0.782
Intron 7 49008 49201 ( 194 n); Pd: 0.601 Pa: 0.941
Exon 8 49202 49500 ( 299 n); score: 0.766
<A HREF="#PGS8@AC137075-45000-50999">PGS</A> (47302 47501,47601 47736,47814 48074,48151 48383,48490 48607,48679 48831,48921 49007,49202 49500) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=32980476" TARGET="NUCLEOTIDE SEQUENCE SEARCH">32980476+</A>
<A HREF="#PGS7@AC137075-45000-50999">PGS</A> (47302 47501,47601 47736,47814 48074,48151 48383,48490 48607,48679 48831,48921 49007,49202 49500) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=12698877" TARGET="NUCLEOTIDE SEQUENCE SEARCH">12698877+</A>
<A HREF="#PGS6@AC137075-45000-50999">PGS</A> (47302 47501,47601 47736,47814 47955) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=29680389" TARGET="NUCLEOTIDE SEQUENCE SEARCH">29680389+</A>
<A HREF="#PGS2@AC137075-45000-50999">PGS</A> (48157 48383,48490 48607,48679 48831,48921 49007,49202 49227) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=27578307" TARGET="NUCLEOTIDE SEQUENCE SEARCH">27578307+</A>
<A HREF="#PGS3@AC137075-45000-50999">PGS</A> (48173 48383,48490 48607,48679 48831,48921 49007,49202 49500) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=32971490" TARGET="NUCLEOTIDE SEQUENCE SEARCH">32971490+</A>
<A HREF="#PGS4@AC137075-45000-50999">PGS</A> (48238 48383,48490 48607,48679 48831,48921 49007,49202 49278) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=33382000" TARGET="NUCLEOTIDE SEQUENCE SEARCH">33382000+</A>
<A HREF="#PGS1@AC137075-45000-50999">PGS</A> (48376 48383,48490 48607,48679 48831,48921 49007,49202 49238) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=14571346" TARGET="NUCLEOTIDE SEQUENCE SEARCH">14571346+</A>
<A HREF="#PGS9@AC137075-45000-50999">PGS</A> (48542 48607,48679 48831,48921 49007,49202 49500) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=29656473" TARGET="NUCLEOTIDE SEQUENCE SEARCH">29656473-</A>
<A HREF="#PGS10@AC137075-45000-50999">PGS</A> (48584 48607,48679 48831,48921 49007,49202 49500) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=29663931" TARGET="NUCLEOTIDE SEQUENCE SEARCH">29663931-</A>
<A HREF="#PGS5@AC137075-45000-50999">PGS</A> (48914 49007,49202 49329) <A HREF="http://www.plantgdb.org/data.php?Seq_ID=2797925" TARGET="NUCLEOTIDE SEQUENCE SEARCH">2797925+</A>
3-phase translation of AGS-1 (+strand):
. . . . . .
47302 CAGGTCCATCACGACATGAATGCACCATTATCGCACTACTTCATATACACTGGACACAAC
Q V H H D M N A P L S H Y F I Y T G H N
R S I T T - M H H Y R T T S Y T L D T T
G P S R H E C T I I A L L H I H W T Q
. . . . . .
47362 TCGTATCTGACGGGCAATCAACTTAGCAGTGACTGCAGTGATATTCCCATCATTAAGGCA
S Y L T G N Q L S S D C S D I P I I K A
R I - R A I N L A V T A V I F P S L R H
L V S D G Q S T - Q - L Q - Y S H H - G
. . . . . .
47422 CTGCAAATAGGCGTCCGTGTAATTGAACTGGACATGTGGCCAAATTCTTCTAAAGATGAT
L Q I G V R V I E L D M W P N S S K D D
C K - A S V - L N W T C G Q I L L K M M
T A N R R P C N - T G H V A K F F - R -
. . : . . . .
47482 GTTGATATTCTCCATGGAAG : GACACTGACTGCCCCAGTATCACTTATCAAATGCTTGAAA
V D I L H G R : T L T A P V S L I K C L K
L I F S M E : G H - L P Q Y H L S N A - N
C - Y S P W K : D T D C P S I T Y Q M L E
. . . . . .
47641 TCCATCAAAGAATATGCCTTTGTTGCGTCTCCCTACCCTGTTATTATAACATTAGAAGAC
S I K E Y A F V A S P Y P V I I T L E D
P S K N M P L L R L P T L L L - H - K T
I H Q R I C L C C V S L P C Y Y N I R R
. . . . : . .
47701 CACCTTACATCTGATCTTCAGGCGAAAGTAGCTAAG : ATGGTTCTTGAAGTATTTGGAGAT
H L T S D L Q A K V A K : M V L E V F G D
T L H L I F R R K - L R : W F L K Y L E I
P P Y I - S S G E S S - : D G S - S I W R
. . . . . .
47838 ACCCTATATTATCCCGAGTCAAAACATCTTCAAGAATTTCCTTCACCCGAAGCACTGAGG
T L Y Y P E S K H L Q E F P S P E A L R
P Y I I P S Q N I F K N F L H P K H - G
Y P I L S R V K T S S R I S F T R S T E
. . . . . .
47898 GGACGTGTCATCCTCTCAACAAAACCCCCAAAGGAGTACCTTGAATCAAAAGGTGGTACT
G R V I L S T K P P K E Y L E S K G G T
D V S S S Q Q N P Q R S T L N Q K V V L
G T C H P L N K T P K G V P - I K R W Y
. . . . . .
47958 ATGAAAGACAGAGACATTGAGCCTCAGTTTAGCAAAGGACAAAATGAAGAAGCTGTCTGG
M K D R D I E P Q F S K G Q N E E A V W
- K T E T L S L S L A K D K M K K L S G
Y E R Q R H - A S V - Q R T K - R S C L
. . . . . . :
48018 GGAACAGAAGTCCCAGATATTCAGGATGAGATGCAAACCGCCGACAAGGTTCTACTG : CAG
G T E V P D I Q D E M Q T A D K V L L : Q
E Q K S Q I F R M R C K P P T R F Y C : S
G N R S P R Y S G - D A N R R Q G S T : A
. . . . . .
48154 CATGAGAATGATATACTATACACCCAAAGAGATGTGGAAGAAGATGATGAGAAGAAAATG
H E N D I L Y T Q R D V E E D D E K K M
M R M I Y Y T P K E M W K K M M R R K C
A - E - Y T I H P K R C G R R - - E E N
. . . . . .
48214 TGCCAGCATCACCCACTAGAGTATAAACACCTTATTACTATTAAGGCAGGAAAGCCAAAG
C Q H H P L E Y K H L I T I K A G K P K
A S I T H - S I N T L L L L R Q E S Q R
V P A S P T R V - T P Y Y Y - G R K A K
. . . . . .
48274 GGTGCTGTAGTTGATGCCTTAAAGGGTGATCCAGATAAAGTTAGACGCCTCAGTTTGAGT
G A V V D A L K G D P D K V R R L S L S
V L - L M P - R V I Q I K L D A S V - V
G C C S - C L K G - S R - S - T P Q F E
. . . . . : .
48334 GAGCAGGAACTTGCAAAAGTGGCAGCGCATCATGGTCGTAACATCGTGAG : CTTTACACAT
E Q E L A K V A A H H G R N I V S : F T H
S R N L Q K W Q R I M V V T S - : A L H I
- A G T C K S G S A S W S - H R E : L Y T
. . . . . .
48500 AAAAATCTTCTGAGAATATACCCAAAGGGCACTCGCTTCAATTCTTCGAACTATAATCCG
K N L L R I Y P K G T R F N S S N Y N P
K I F - E Y T Q R A L A S I L R T I I R
- K S S E N I P K G H S L Q F F E L - S
. . . . . : .
48560 TTTCTTGGTTGGGTGCATGGTGCACAAATGGTGGCATTTAATATGCAG : GGGTATGGAAGA
F L G W V H G A Q M V A F N M Q : G Y G R
F L V G C M V H K W W H L I C R : G M E D
V S W L G A W C T N G G I - Y A : G V W K
. . . . . .
48691 TCTCTTTGGCTAATGCACGGATTCTACAAGGCCAACGGTGGCTGCGGTTATGTGAAGAAG
S L W L M H G F Y K A N G G C G Y V K K
L F G - C T D S T R P T V A A V M - R S
I S L A N A R I L Q G Q R W L R L C E E
. . . . . .
48751 CCAGATTTCATGATGCAAACTTGTCCAGATGGAAATGTTTTTGACCCGAAAGCAGATTTA
P D F M M Q T C P D G N V F D P K A D L
Q I S - C K L V Q M E M F L T R K Q I Y
A R F H D A N L S R W K C F - P E S R F
. . . : . . .
48811 CCTGTGAAGAAAACACTCAAG : GTCAAAGTATACATGGGCGAAGGTTGGCAGAGCGACTTC
P V K K T L K : V K V Y M G E G W Q S D F
L - R K H S R : S K Y T W A K V G R A T S
T C E E N T Q : G Q S I H G R R L A E R L
. . . . . : .
48960 AAGCAGACATACTTCGACACGTATTCCCCTCCAGACTTCTACGCAAAG : GTGGGCATTGCC
K Q T Y F D T Y S P P D F Y A K : V G I A
S R H T S T R I P L Q T S T Q R : W A L P
Q A D I L R H V F P S R L L R K : G G H C
. . . . . .
49214 GGGGTTCCGTCGGACTCGGTGATGCAGAAGACGAAAGCCGTGGAGGACAGCTGGGTTCCC
G V P S D S V M Q K T K A V E D S W V P
G F R R T R - C R R R K P W R T A G F P
R G S V G L G D A E D E S R G G Q L G S
. . . . . .
49274 GTGTGGGAGGAGGAGTTCGTGTTCCCGCTGACCGTCCCGGAGATCGCGCTGCTCCGCGTG
V W E E E F V F P L T V P E I A L L R V
C G R R S S C S R - P S R R S R C S A W
R V G G G V R V P A D R P G D R A A P R
. . . . . .
49334 GAGGTGCACGAGTACGACGTGAGCGAGGACGACTTCGGCGGGCAGACGGCGCTCCCGGTG
E V H E Y D V S E D D F G G Q T A L P V
R C T S T T - A R T T S A G R R R S R C
G G A R V R R E R G R L R R A D G A P G
. . . . . .
49394 TCGGAGCTGCGGCCGGGGATCCGCACCGTGCCGCTCTTCGACCACAAGGGGCTCAAGTTC
S E L R P G I R T V P L F D H K G L K F
R S C G R G S A P C R S S T T R G S S S
V G A A A G D P H R A A L R P Q G A Q V
. . . . .
49454 AAGAGCGTCAAGCTCCTCATGCGGTTCGAGTTCGTCTAGCAAATTCA
K S V K L L M R F E F V - Q I
R A S S S S C G S S S S S K F
Q E R Q A P H A V R V R L A N S
Maximal non-overlapping open reading frames (>= 64 codons):
<A NAME="PGL1_ORF1@AC137075-45000-50999"></A>
>AC137075+_PGL-1_AGS-1_PPS_1 (47302 47501,47601 47736,47814 48074,48151 48383,48490 48607,48679 48831,48921 49007,49202 49492) (frame '1'; 1476 bp, 492 residues)
1 QVHHDMNAPL SHYFIYTGHN SYLTGNQLSS DCSDIPIIKA LQIGVRVIEL DMWPNSSKDD
61 VDILHGRTLT APVSLIKCLK SIKEYAFVAS PYPVIITLED HLTSDLQAKV AKMVLEVFGD
121 TLYYPESKHL QEFPSPEALR GRVILSTKPP KEYLESKGGT MKDRDIEPQF SKGQNEEAVW
181 GTEVPDIQDE MQTADKVLLQ HENDILYTQR DVEEDDEKKM CQHHPLEYKH LITIKAGKPK
241 GAVVDALKGD PDKVRRLSLS EQELAKVAAH HGRNIVSFTH KNLLRIYPKG TRFNSSNYNP
301 FLGWVHGAQM VAFNMQGYGR SLWLMHGFYK ANGGCGYVKK PDFMMQTCPD GNVFDPKADL
361 PVKKTLKVKV YMGEGWQSDF KQTYFDTYSP PDFYAKVGIA GVPSDSVMQK TKAVEDSWVP
421 VWEEEFVFPL TVPEIALLRV EVHEYDVSED DFGGQTALPV SELRPGIRTV PLFDHKGLKF
481 KSVKLLMRFE FV-
<A HREF=" http://www.ncbi.nlm.nih.gov/blast/Blast.cgi?CMD=Web&LAYOUT=TwoWindows&AUTO_FORMAT=Semiauto&ALIGNMENTS=50&ALIGNMENT_VIEW=Pairwise&CDD_SEARCH=on&CLIENT=web&COMPOSITION_BASED_STATISTICS=on&DATABASE=nr&DESCRIPTIONS=100&ENTREZ_QUERY=%28none%29&EXPECT=10&FILTER=L&FORMAT_OBJECT=Alignment&FORMAT_TYPE=HTML&I_THRESH=0.005&MATRIX_NAME=BLOSUM62&NCBI_GI=on&PAGE=Proteins&PROGRAM=blastp&SERVICE=plain&SET_DEFAULTS.x=41&SET_DEFAULTS.y=5&SHOW_OVERVIEW=on&END_OF_HTTPGET=Yes&SHOW_LINKOUT=yes&GET_SEQUENCE=yes&QUERY=QVHHDMNAPLSHYFIYTGHNSYLTGNQLSSDCSDIPIIKALQIGVRVIELDMWPNSSKDDVDILHGRTLTAPVSLIKCLKSIKEYAFVASPYPVIITLEDHLTSDLQAKVAKMVLEVFGDTLYYPESKHLQEFPSPEALRGRVILSTKPPKEYLESKGGTMKDRDIEPQFSKGQNEEAVWGTEVPDIQDEMQTADKVLLQHENDILYTQRDVEEDDEKKMCQHHPLEYKHLITIKAGKPKGAVVDALKGDPDKVRRLSLSEQELAKVAAHHGRNIVSFTHKNLLRIYPKGTRFNSSNYNPFLGWVHGAQMVAFNMQGYGRSLWLMHGFYKANGGCGYVKKPDFMMQTCPDGNVFDPKADLPVKKTLKVKVYMGEGWQSDFKQTYFDTYSPPDFYAKVGIAGVPSDSVMQKTKAVEDSWVPVWEEEFVFPLTVPEIALLRVEVHEYDVSEDDFGGQTALPVSELRPGIRTVPLFDHKGLKFKSVKLLMRFEFV-&END_OF_HTTPGET=Yes" TARGET="NCBI Blastp">NCBI Blastp</A>
<A HREF="#HEAD-PGL-AC137075-45000-50999">Scroll back to "Predicted gene locations"</A>
<A HREF="#TOP">Scroll up to top</A>
<A NAME="BOTTOM-PGL-AC137075-45000-50999"></A>
<A NAME="PPGS2@AC137075-45000-50999"></A>
********************************************************************************
Query protein sequence 2 (File: <A HREF="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Search&db=Protein&term=AtPLC2&doptcmdl=GenPept" TARGET="PROTEIN SEQUENCE SEARCH">AtPLC2</A>)
1 MSKQTYKVCF CFRRRFRYTA SEAPREIKTI FEKYSENGVM TVDHLHRFLI DVQKQDKATR
61 EDAQSIINSA SSLLHRNGLH LDAFFKYLFG DNNPPLALHK VHHDMDAPIS HYFIFTGHNS
121 YLTGNQLSSD CSEVPIIDAL KKGVRVIELD IWPNSNKDDI DVLHGMTLTT PVGLIKCLKA
181 IRAHAFDVSD YPVVVTLEDH LTPDLQSKVA EMVTEIFGEI LFTPPVGESL KEFPSPNSLK
241 RRIIISTKPP KEYKEGKDVE VVQKGKDLGD EEVWGREVPS FIQRNKSEAK DDLDGNDDDD
301 DDDDEDKSKI NAPPQYKHLI AIHAGKPKGG ITECLKVDPD KVRRLSLSEE QLEKAAEKYA
361 KQIVRFTQHN LLRIYPKGTR VTSSNYNPLV GWSHGAQMVA FNMQGYGRSL WLMQGMFRAN
421 GGCGYIKKPD LLLKSGSDSD IFDPKATLPV KTTLRVTVYM GEGWYFDFRH THFDQYSPPD
481 FYTRVGIAGV PGDTVMKKTK TLEDNWIPAW DEVFEFPLTV PELALLRLEV HEYDMSEKDD
541 FGGQTCLPVW ELSEGIRAFP LHSRKGEKYK SVKLLVKVEF V-
Predicted gene structure (within gDNA segment 45001 to 51000):
Exon 1 46116 46439 ( 324 n); Protein 1 100 ( 100 aa); score: 0.234
Intron 1 46440 47304 ( 865 n); Pd: 0.852 Pa: 0.016
Exon 2 47305 47501 ( 197 n); Protein 101 166 ( 66 aa); score: 0.858
Intron 2 47502 47600 ( 99 n); Pd: 0.981 Pa: 0.943
Exon 3 47601 47736 ( 136 n); Protein 167 211 ( 45 aa); score: 0.716
Intron 3 47737 47813 ( 77 n); Pd: 0.967 Pa: 0.462
Exon 4 47814 48065 ( 252 n); Protein 212 288 ( 77 aa); score: 0.411
Intron 4 48066 48153 ( 88 n); Pd: 0.994 Pa: 0.957
Exon 5 48154 48383 ( 230 n); Protein 289 365 ( 77 aa); score: 0.458
Intron 5 48384 48489 ( 106 n); Pd: 0.981 Pa: 0.477
Exon 6 48490 48607 ( 118 n); Protein 366 404 ( 39 aa); score: 0.824
Intron 6 48608 48678 ( 71 n); Pd: 0.976 Pa: 0.976
Exon 7 48679 48831 ( 153 n); Protein 405 455 ( 51 aa); score: 0.733
Intron 7 48832 48920 ( 89 n); Pd: 0.988 Pa: 0.739
Exon 8 48921 49007 ( 87 n); Protein 456 484 ( 29 aa); score: 0.699
Intron 8 49008 49201 ( 194 n); Pd: 0.601 Pa: 0.941
Exon 9 49202 49492 ( 291 n); Protein 485 581 ( 97 aa); score: 0.691
MATCH AC137075+ AtPLC2 0.567 1788 1.024 P
PGS_AC137075+_AtPLC2 (46116 46439,47305 47501,47601 47736,47814 48065,48154 48383,48490 48607,48679 48831,48921 49007,49202 49492)
Alignment:
GCGCAAATGG GGACGTACAA GTGCTGCATC TTCTTCACCC GCAGGTTCGC GCTGAGCGAC 46175
A Q M G T Y K C C I F F T R R F A L S D
. | | | | . | | | | +
M S K Q T Y K V C F C F R R R F R Y T A 20
GCGTCCACGC CGGGCGACGT GCGCATGCTG TTCACCCGCC ACGCCGGCGG CGCGCCCTAC 46235
A S T P G D V R M L F T R H A G G A P Y
+ . . | + + + + | + + + . .
S E A P R E I K T I F E K Y S E N G - V 39
ATGGGCATCG ACGAGCTCCG GCGCTACCTC GCCGCCAGCG GGGAGGCCCA CGTCGACGCC 46295
M G I D E L R R Y L A A S G E A H V D A
| + | . | . | + | . + . |
M T V D H L H R F L - I D V Q K Q - D K 57
GACACGGCGG AGCGGATCAT CGACCGGGTC CTGCAGGAGC GCAGCCGCAC CCCGCGCTTC 46355
D T A E R I I D R V L Q E R S R T P R F
| | . + + . . | + .
A T R E D - A Q S I I N S A S - S - L L 74
GGGAAGCCGT CGCTCACCAT CGACGATTTC CAGTACTTCC TCTTCTCCGA GGACCTCAAC 46415
G K P S L T I D D F Q Y F L F S E D L N
+ . | + | | + | | . | |
H R N G L H L D A F F K Y L F G - D N N 93
CCGCCCATCT GCCATTCCAA GGAAGTAAGC AAACTACCCG CTCGATCCCC AATTTCCCAA 46475
P P I C H S K E
| | + . |
P P L A L H K - ...... .......... .......... .......... 100
ATGCTGTTAG ATTCATCGTC ATTCCGTGAT AATCCTGCCG TTGCACAATG CGGTGAAATG 46535
.......... .......... .......... .......... .......... .......... 100
GCGTAATTTG CTAGGATTCA GAAGGGGATT CTTGGGGTTT GTTTAGTTCA CATTAAAATT 46595
.......... .......... .......... .......... .......... .......... 100
AAAAGTTTGG TTAAAATTGG AATGATGTGA CGAAAAGTTA GAAGTTTGTG TGTGCAGGAA 46655
.......... .......... .......... .......... .......... .......... 100
AGTTTTGATG CGATGGAAAA GTTGGAAGTT TGAAGAAAAA AATTAAAACT AAACATGGCT 46715
.......... .......... .......... .......... .......... .......... 100
TTGGTCGGAA CTGCTCTGTA GTGTGGACGT CATTCAAATC TTTATGAAGT ATTTTTTTAA 46775
.......... .......... .......... .......... .......... .......... 100
AGATGGATCA CACATGTGAT TAACATAGTT ATATAAAATT TTGTTAAAAT TTGAAAATGT 46835
.......... .......... .......... .......... .......... .......... 100
AGAATACGAT GATATAAATC ACTATATAAA CATGCAAGTT TAAATTTGAT CCACGCAAAG 46895
.......... .......... .......... .......... .......... .......... 100
AGAAAAAATA TAACCGATTA TGTTTGAGTT GTGGCATTAC TATTTTCTAT CTGGTTCTAT 46955
.......... .......... .......... .......... .......... .......... 100
TAATTTTTTT TCTCCAATTG TAGATCGAAT CAAGCCTTTG TATGTTTGTA CATAGACTTA 47015
.......... .......... .......... .......... .......... .......... 100
TGCTATCGTA ATCTACTCCC ATTTTTTTTG GACGGAGGGA GTATGTTATC AATTTTAGTT 47075
.......... .......... .......... .......... .......... .......... 100
TAATTTTTTT TACAACTATT TGGGTCACAT ACAAATAACT GGCACATATG CACCTAGGTG 47135
.......... .......... .......... .......... .......... .......... 100
TAAAGAAGTC AACATGCAGG TAATGAATTG AATTTCCATA CAACATTCTG CTCTCCTAAG 47195
.......... .......... .......... .......... .......... .......... 100
AAATTACGCT TACAAGTTCA CTTGGATATT GCTAAACTCC ATTTTGATAT TACTTAGTGT 47255
.......... .......... .......... .......... .......... .......... 100
GTACTGAATG ATCTAAGATG TGAGTTGATG GTAGATCTCG TGCTCTCAGG TCCATCACGA 47315
V H H D
| | | |
.......... .......... .......... .......... ......... V H H D 104
CATGAATGCA CCATTATCGC ACTACTTCAT ATACACTGGA CACAACTCGT ATCTGACGGG 47375
M N A P L S H Y F I Y T G H N S Y L T G
| + | | + | | | | | + | | | | | | | | |
M D A P I S H Y F I F T G H N S Y L T G 124
CAATCAACTT AGCAGTGACT GCAGTGATAT TCCCATCATT AAGGCACTGC AAATAGGCGT 47435
N Q L S S D C S D I P I I K A L Q I G V
| | | | | | | | + + | | | | | + | |
N Q L S S D C S E V P I I D A L K K G V 144
CCGTGTAATT GAACTGGACA TGTGGCCAAA TTCTTCTAAA GATGATGTTG ATATTCTCCA 47495
R V I E L D M W P N S S K D D V D I L H
| | | | | | + | | | | + | | | + | + | |
R V I E L D I W P N S N K D D I D V L H 164
TGGAAGGTAT GCATGAGAAT TGCTCACTTG AAGACATTTT TGTTCTGCAC TGGAGGCCAT 47555
G R
|
G M.... .......... .......... .......... .......... .......... 166
TCGATATGCT ATGACCTTAT TCCAAACTAT TTGCTTCTTT GGTAGGACAC TGACTGCCCC 47615
T L T A P
| | | . |
.......... .......... .......... .......... ..... T L T T P 171
AGTATCACTT ATCAAATGCT TGAAATCCAT CAAAGAATAT GCCTTTGTTG CGTCTCCCTA 47675
V S L I K C L K S I K E Y A F V A S P Y
| . | | | | | | + | + + | | . | |
V G L I K C L K A I R A H A F D V S D Y 191
CCCTGTTATT ATAACATTAG AAGACCACCT TACATCTGAT CTTCAGGCGA AAGTAGCTAA 47735
P V I I T L E D H L T S D L Q A K V A K
| | + + | | | | | | | | | | + | | | +
P V V V T L E D H L T P D L Q S K V A E 211
GGTAATTGCA TTTTCCTCGT ATGATCAATA ATTTGGTGCA GTTGATTCTG TTGTAGCTAG 47795
......... .......... .......... .......... .......... .......... 211
TTATGAAATT TTCTTTAGAT GGTTCTTGAA GTATTTGGAG ATACCCTATA TTATCCCGAG 47855
M V L E V F G D T L Y Y P E
| | | + | | + | + |
.......... ........ M V T E I F G E I L F T P P 225
---TCAAAAC ATCTTCAAGA ATTTCCTTCA CCCGAAGCAC TGAGGGGACG TGTCATCCTC 47912
S K H L Q E F P S P E A L R G R V I L
. + | + | | | | | . + | + | + | +
V G E S L K E F P S P N S L K R R I I I 245
TCAACAAAAC CCCCAAAGGA GTACCTTGAA TCAAAAGGTG GTACTATGAA AGACAGAGAC 47972
S T K P P K E Y L E S K G G T M K D R D
| | | | | | | | | | | | +
S T K P P K E Y - - - K E G - - K D V E 260
ATTGAGCCTC AGTTTAGCAA AGGACAAAAT GAAGAAGCTG TCTGGGGAAC AGAAGTCCCA 48032
I E P Q F S K G Q N E E A V W G T E V P
+ | . | . + | | | | | | |
V - V Q K G K D L G D E E V W G R E V P 279
GATATTCAGG ATGAGATGC- AAACCGCCGA CAAGGTTCTA CTGGTTTTAA CATTTGTTGT 48091
D I Q D E M N R R Q
. . + | + +
S F - I Q R - N K S E ...... .......... .......... 288
TTCTTGTTTC TTAGCATATG GTGTATGTCC ATCACTGTTG TATTGGCTTT ATTCCCTAGC 48151
.......... .......... .......... .......... .......... .......... 288
AGCA-T-GAG AATGATATAC TATACACCCA AAGAGATGTG GAAGAAGATG ATGAGAAGAA 48209
A E N D I L Y T Q R D V E E D D E K K
| + + | + . . | + + | | | |
..A - K D D L D G N D D D D D D D D E D K 307
AATGTGCCAG CATCACCCAC TAGAGTATAA ACACCTTATT ACTATTAAGG CAGGAAAGCC 48269
M C Q H H P L E Y K H L I T I K A G K P
+ | + | | | | | . | | | | |
S K I N A P P Q Y K H L I A I H A G K P 327
AAAGGGTGCT GTAGTTGATG CCTTAAAGGG TGATCCAGAT AAAGTTAGAC GCCTCAGTTT 48329
K G A V V D A L K G D P D K V R R L S L
| | . + . + . | | | | | | | | | | | |
K G G I T E C L K V D P D K V R R L S L 347
GAGTGAGCAG GAACTTGCAA AAGTGGCAGC GCATCATGGT CGTAACATCG TGAGGTTCGT 48389
S E Q E L A K V A A H H G R N I V S
| | + + | | . | + . + . | |
S E E Q L E K A A E K Y A K Q I V R...... 365
TTAGCAAATA TACTGAATTT CGTAGCAAAG TATTTTCTAT CATTGCACCA GAGCTCTCTA 48449
.......... .......... .......... .......... .......... .......... 365
TGTCCATTGA CCTTAACTTC ATTCTGTTTA TTCAAAGCAG CTTTACACAT AAAAATCTTC 48509
F T H K N L
| | . | |
.......... .......... .......... .......... F T Q H N L 371
TGAGAATATA CCCAAAGGGC ACTCGCTTCA ATTCTTCGAA CTATAATCCG TTTCTTGGTT 48569
L R I Y P K G T R F N S S N Y N P F L G
| | | | | | | | | . | | | | | | . + |
L R I Y P K G T R V T S S N Y N P L V G 391
GGGTGCATGG TGCACAAATG GTGGCATTTA ATATGCAGGT ACATTTCTAA CATGACACTC 48629
W V H G A Q M V A F N M Q
| | | | | | | | | | | |
W S H G A Q M V A F N M Q .. .......... .......... 404
CTCTGCTACA TCATATTGGC CTGAATGCCT GATACATTTT TCTTCGCAGG GGTATGGAAG 48689
G Y G R
| | | |
.......... .......... .......... .......... ......... G Y G R 408
ATCTCTTTGG CTAATGCACG GATTCTACAA GGCCAACGGT GGCTGCGGTT ATGTGAAGAA 48749
S L W L M H G F Y K A N G G C G Y V K K
| | | | | . | . + + | | | | | | | + | |
S L W L M Q G M F R A N G G C G Y I K K 428
GCCAGATTTC ATGATGCAAA CTTGTCCAGA TGGAAATGTT TTTGACCCGA AAGCAGATTT 48809
P D F M M Q T C P D G N V F D P K A D L
| | . + + + + | . + + | | | | | |
P D L L L K S G S D S D I F D P K A T L 448
ACCTGTGAAG AAAACACTCA AGGTAGGTTT GTGGCATATG TTTCTTCCTT TCATTTTCAT 48869
P V K K T L K
| | | | | +
P V K T T L R ........ .......... .......... .......... 455
CTCTGAAATT CAGGAATCGA GCTACTTACA GCTTGCCTGT TTGTCTACCA GGTCAAAGTA 48929
V K V
| |
.......... .......... .......... .......... .......... . V T V 458
TACATGGGCG AAGGTTGGCA GAGCGACTTC AAGCAGACAT ACTTCGACAC GTATTCCCCT 48989
Y M G E G W Q S D F K Q T Y F D T Y S P
| | | | | | | | + . | + | | | | |
Y M G E G W Y F D F R H T H F D Q Y S P 478
CCAGACTTCT ACGCAAAGGT ACATCGAATT TTACGCTGAT GCCAAACGCC AACAAATTTG 49049
P D F Y A K
| | | | . +
P D F Y T R .. .......... .......... .......... .......... 484
CAAATGCAAA ACGGAGCTTT GAAAAAACAT GTATATATGT ATAACTTTTA CATATGGAGT 49109
.......... .......... .......... .......... .......... .......... 484
GAGATGAAGA CAAACTTTAT ATCAAAATTG TAGAGCTCCA TGAGTTCTAC GACGTTCTTA 49169
.......... .......... .......... .......... .......... .......... 484
TTGACTAGTC CATCGTTCCA TCATCATAAC AGGTGGGCAT TGCCGGGGTT CCGTCGGACT 49229
V G I A G V P S D
| | | | | | | . |
.......... .......... .......... .. V G I A G V P G D 493
CGGTGATGCA GAAGACGAAA GCCGTGGAGG ACAGCTGGGT TCCCGTGTGG GAGGAGGAGT 49289
S V M Q K T K A V E D S W V P V W E E E
+ | | + | | | . + | | + | + | . | + |
T V M K K T K T L E D N W I P A W D E V 513
TCGTGTTCCC GCTGACCGTC CCGGAGATCG CGCTGCTCCG CGTGGAGGTG CACGAGTACG 49349
F V F P L T V P E I A L L R V E V H E Y
| | | | | | | | + | | | | + | | | | |
F E F P L T V P E L A L L R L E V H E Y 533
ACGTGAGCGA G---GACGAC TTCGGCGGGC AGACGGCGCT CCCGGTGTCG GAGCTGCGGC 49406
D V S E D D F G G Q T A L P V S E L R
| + | | | | | | | | | . | | | | |
D M S E K D D F G G Q T C L P V W E L S 553
CGGGGATCCG CACCGTGCCG CTCTTCGACC ACAAGGGGCT CAAGTTCAAG AGCGTCAAGC 49466
P G I R T V P L F D H K G L K F K S V K
| | | . | | . . | | | + | | | |
E G I R A F P L H S R K G E K Y K S V K 573
TCCTCATGCG GTTCGAGTTC GTCTAG 49492
L L M R F E F V *
| | + + | | |
L L V K V E F V * 582
<A NAME="PPGS1@AC137075-45000-50999"></A>
********************************************************************************
Query protein sequence 1 (File: <A HREF="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Search&db=Protein&term=OsPLCa&doptcmdl=GenPept" TARGET="PROTEIN SEQUENCE SEARCH">OsPLCa</A>)
1 MGTYKCCIFF TRRFALSDAS TPGDVRMLFT RHAGGAPYMG IDELRRYLAA SGEAHVDADT
61 AERIIDRVLQ ERSRTPRFGK PSLTIDDFQY FLFSEDLNPP ICHSKEVHHD MNAPLSHYFI
121 YTGHNSYLTG NQLSSDCSDI PIIKALQIGV RVIELDMWPN SSKDDVDILH GRTLTAPVSL
181 IKCLKSIKEY AFVASPYPVI ITLEDHLTSD LQAKVAKMVL EVFGDTLYYP ESKHLQEFPS
241 PEALRGRVIL STKPPKEYLE SKGGTMKDRD IEPQFSKGQN EEAVWGTEVP DIQDEMQTAD
301 KVLLHENDIL YTQRDVEEDD EKKMCQHHPL EYKHLITIKA GKPKGAVVDA LKGDPDKVRR
361 LSLSEQELAK VAAHHGRNIV SFTHKNLLRI YPKGTRFNSS NYNPFLGWVH GAQMVAFNMQ
421 GYGRSLWLMH GFYKANGGCG YVKKPDFMMQ TCPDGNVFDP KADLPVKKTL KVKVYMGEGW
481 QSDFKQTYFD TYSPPDFYAK VGIAGVPSDS VMQKTKAVED SWVPVWEEEF VFPLTVPEIA
541 LLRVEVHEYD VSEDDFGGQT ALPVSELRPG IRTVPLFDHK GLKFKSVKLL MRFEFV-
Predicted gene structure (within gDNA segment 45001 to 51000):
Exon 1 46122 46439 ( 318 n); Protein 1 106 ( 106 aa); score: 1.000
Intron 1 46440 47304 ( 865 n); Pd: 0.852 Pa: 0.016
Exon 2 47305 47501 ( 197 n); Protein 107 172 ( 66 aa); score: 1.000
Intron 2 47502 47600 ( 99 n); Pd: 0.981 Pa: 0.943
Exon 3 47601 47736 ( 136 n); Protein 173 217 ( 45 aa); score: 1.000
Intron 3 47737 47813 ( 77 n); Pd: 0.967 Pa: 0.462
Exon 4 47814 48074 ( 261 n); Protein 218 304 ( 87 aa); score: 1.000
Intron 4 48075 48153 ( 79 n); Pd: 0.950 Pa: 0.957
Exon 5 48154 48383 ( 230 n); Protein 305 381 ( 77 aa); score: 1.000
Intron 5 48384 48489 ( 106 n); Pd: 0.981 Pa: 0.477
Exon 6 48490 48607 ( 118 n); Protein 382 420 ( 39 aa); score: 1.000
Intron 6 48608 48678 ( 71 n); Pd: 0.976 Pa: 0.976
Exon 7 48679 48831 ( 153 n); Protein 421 471 ( 51 aa); score: 1.000
Intron 7 48832 48920 ( 89 n); Pd: 0.988 Pa: 0.739
Exon 8 48921 49007 ( 87 n); Protein 472 500 ( 29 aa); score: 1.000
Intron 8 49008 49201 ( 194 n); Pd: 0.601 Pa: 0.941
Exon 9 49202 49492 ( 291 n); Protein 501 596 ( 96 aa); score: 1.000
MATCH AC137075+ OsPLCa 1.000 1791 1.000 P
PGS_AC137075+_OsPLCa (46122 46439,47305 47501,47601 47736,47814 48074,48154 48383,48490 48607,48679 48831,48921 49007,49202 49492)
Alignment:
ATGGGGACGT ACAAGTGCTG CATCTTCTTC ACCCGCAGGT TCGCGCTGAG CGACGCGTCC 46181
M G T Y K C C I F F T R R F A L S D A S
| | | | | | | | | | | | | | | | | | | |
M G T Y K C C I F F T R R F A L S D A S 20
ACGCCGGGCG ACGTGCGCAT GCTGTTCACC CGCCACGCCG GCGGCGCGCC CTACATGGGC 46241
T P G D V R M L F T R H A G G A P Y M G
| | | | | | | | | | | | | | | | | | | |
T P G D V R M L F T R H A G G A P Y M G 40
ATCGACGAGC TCCGGCGCTA CCTCGCCGCC AGCGGGGAGG CCCACGTCGA CGCCGACACG 46301
I D E L R R Y L A A S G E A H V D A D T
| | | | | | |