PCP16_ARATH
ID PCP16_ARATH Reviewed; 108 AA.
AC Q5S502; Q5XVJ0; Q9C6D4;
DT 10-MAY-2017, integrated into UniProtKB/Swiss-Prot.
DT 21-DEC-2004, sequence version 1.
DT 03-AUG-2022, entry version 100.
DE RecName: Full=Precursor of CEP16 {ECO:0000305};
DE Short=PCEP16 {ECO:0000305};
DE Contains:
DE RecName: Full=C-terminally encoded peptide 16 {ECO:0000305};
DE Short=CEP16 {ECO:0000305};
DE Flags: Precursor;
GN Name=CEP16 {ECO:0000305};
GN OrderedLocusNames=At1g49800 {ECO:0000312|Araport:AT1G49800};
GN ORFNames=F10F5.5 {ECO:0000312|EMBL:AAG51771.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA / MRNA].
RC STRAIN=cv. Columbia;
RA Underwood B.A., Xiao Y.-L., Moskal W.A. Jr., Monaghan E.L., Wang W.,
RA Redman J.C., Wu H.C., Utterback T., Town C.D.;
RL Submitted (OCT-2004) to the EMBL/GenBank/DDBJ databases.
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RX PubMed=16244158; DOI=10.1104/pp.105.063479;
RA Xiao Y.-L., Smith S.R., Ishmael N., Redman J.C., Kumar N., Monaghan E.L.,
RA Ayele M., Haas B.J., Wu H.C., Town C.D.;
RT "Analysis of the cDNAs of hypothetical genes on Arabidopsis chromosome 2
RT reveals numerous transcript variants.";
RL Plant Physiol. 139:1323-1337(2005).
CC -!- FUNCTION: Extracellular signaling peptide that may regulate primary
CC root growth rate and systemic nitrogen (N)-demand signaling.
CC {ECO:0000250|UniProtKB:Q8L8Y3}.
CC -!- SUBUNIT: Interacts with CEP receptors (e.g. CEPR1 and CEPR2).
CC {ECO:0000250|UniProtKB:Q8L8Y3}.
CC -!- SUBCELLULAR LOCATION: [C-terminally encoded peptide 16]: Secreted,
CC extracellular space, apoplast {ECO:0000250|UniProtKB:O80460}.
CC Note=Accumulates in xylem sap. {ECO:0000250|UniProtKB:O80460}.
CC -!- PTM: The mature small signaling peptide is generated by proteolytic
CC processing of the longer precursor. {ECO:0000250|UniProtKB:Q8L8Y3}.
CC -!- SIMILARITY: Belongs to the C-terminally encoded plant signaling peptide
CC (CEP) family. {ECO:0000305}.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAG51771.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC079674; AAG51771.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002684; AEE32478.1; -; Genomic_DNA.
DR EMBL; AY735532; AAU44402.1; -; mRNA.
DR EMBL; AY773823; AAV63852.1; -; Genomic_DNA.
DR PIR; G96534; G96534.
DR RefSeq; NP_175402.2; NM_103867.3.
DR AlphaFoldDB; Q5S502; -.
DR SMR; Q5S502; -.
DR STRING; 3702.AT1G49800.1; -.
DR PaxDb; Q5S502; -.
DR PRIDE; Q5S502; -.
DR EnsemblPlants; AT1G49800.1; AT1G49800.1; AT1G49800.
DR GeneID; 841403; -.
DR Gramene; AT1G49800.1; AT1G49800.1; AT1G49800.
DR KEGG; ath:AT1G49800; -.
DR Araport; AT1G49800; -.
DR TAIR; locus:2007238; AT1G49800.
DR eggNOG; ENOG502R1TS; Eukaryota.
DR HOGENOM; CLU_2284599_0_0_1; -.
DR InParanoid; Q5S502; -.
DR OMA; MSSFKPV; -.
DR OrthoDB; 1565384at2759; -.
DR PhylomeDB; Q5S502; -.
DR PRO; PR:Q5S502; -.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; Q5S502; baseline and differential.
DR GO; GO:0048046; C:apoplast; ISS:UniProtKB.
DR GO; GO:0005179; F:hormone activity; ISS:UniProtKB.
DR GO; GO:0045087; P:innate immune response; IEA:InterPro.
DR GO; GO:1902025; P:nitrate import; ISS:UniProtKB.
DR GO; GO:2000280; P:regulation of root development; ISS:UniProtKB.
DR InterPro; IPR044700; PIP2/PIPL1.
DR PANTHER; PTHR34663; PTHR34663; 1.
PE 3: Inferred from homology;
KW Apoplast; Developmental protein; Glycoprotein; Hormone; Hydroxylation;
KW Reference proteome; Secreted; Signal.
FT SIGNAL 1..27
FT /evidence="ECO:0000255"
FT PROPEP 28..92
FT /evidence="ECO:0000305"
FT /id="PRO_0000440016"
FT PEPTIDE 93..108
FT /note="C-terminally encoded peptide 16"
FT /evidence="ECO:0000250|UniProtKB:Q058G9"
FT /id="PRO_0000440017"
FT REGION 76..108
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 102
FT /note="Hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:Q058G9"
FT MOD_RES 104
FT /note="Hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:Q8L8Y3"
FT CARBOHYD 50
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00498"
FT CARBOHYD 98
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00498"
SQ SEQUENCE 108 AA; 11689 MW; 0391C324CC8D28E5 CRC64;
MVMAKNLTKF YVVFLVVLMM VVSLLLAIEG RPVKDSSRSL TQMRDSSMFN GSVIMSSFKP
VESSVKDLSW LATVKQSGPS PGVGHHRAKG YKMFGRANDS GPSPGVGH