y PlantCAZyme

CAZyme Information

Basic Information
SpeciesArabidopsis thaliana
Cazyme IDAT5G26000.1
FamilyGH1
Protein PropertiesLength: 542 Molecular Weight: 61132.8 Isoelectric Point: 5.7031
ChromosomeChromosome/Scaffold: 5 Start: 9079505 End: 9082384
DescriptionOs4bglu12 - beta-glucosidase, exo-beta-glucanase, expressed
View CDS
External Links
TAIR
Geo Profiles
ATTED-II
NCBI Taxonomy
Plaza
SIGnAL
CAZyDB
Entrez Gene
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1415120
  GNFEKGFIFGVASSAYQVEGGRGRGLNVWDSFTHRFPEKGGADLGNGDTTCDSYTLWQKDIDVMDELNSTGYRFSIAWSRLLPKGKRSRGVNPGAIKYYN
  GLIDGLVAKNMTPFVTLFHWDLPQTLQDEYNGFLNKTIVDDFKDYADLCFELFGDRVKNWITINQLYTVPTRGYALGTDAPGRCSPKIDVRCPGGNSSTE
  PYIVAHNQLLAHAAAVDVYRTKYKDDQKGMIGPVMITRWFLPFDHSQESKDATERAKIFFHGWFMGPLTEGKYPDIMREYVGDRLPEFSETEAALVKGSY
  DFLGLNYYVTQYAQNNQTIVPSDVHTALMDSRTTLTSKNATGHAPGPPFNAASYYYPKGIYYVMDYFKTTYGDPLIYVTENGFSTPGDEDFEKATADYKR
  IDYLCSHLCFLSKVIKEKNVNVKGYFAWSLGDNYEFCNGFTVRFGLSYVDFANITGDRDLKASGKWFQKFIN
Full Sequence
Protein Sequence     Length: 542     Download
MKLLMLAFVF LLALATCKGD EFVCEENEPF TCNQTKLFNS GNFEKGFIFG VASSAYQVEG    60
GRGRGLNVWD SFTHRFPEKG GADLGNGDTT CDSYTLWQKD IDVMDELNST GYRFSIAWSR    120
LLPKGKRSRG VNPGAIKYYN GLIDGLVAKN MTPFVTLFHW DLPQTLQDEY NGFLNKTIVD    180
DFKDYADLCF ELFGDRVKNW ITINQLYTVP TRGYALGTDA PGRCSPKIDV RCPGGNSSTE    240
PYIVAHNQLL AHAAAVDVYR TKYKDDQKGM IGPVMITRWF LPFDHSQESK DATERAKIFF    300
HGWFMGPLTE GKYPDIMREY VGDRLPEFSE TEAALVKGSY DFLGLNYYVT QYAQNNQTIV    360
PSDVHTALMD SRTTLTSKNA TGHAPGPPFN AASYYYPKGI YYVMDYFKTT YGDPLIYVTE    420
NGFSTPGDED FEKATADYKR IDYLCSHLCF LSKVIKEKNV NVKGYFAWSL GDNYEFCNGF    480
TVRFGLSYVD FANITGDRDL KASGKWFQKF INVTDEDSTN QDLLRSSVSS KNRDRKSLAD    540
A*                                                                   600
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
COG2723BglB2.0e-11642518489+
PLN02849PLN028494.0e-11738530502+
PLN02814PLN028143.0e-12038525494+
TIGR03356BGL9.0e-12445507473+
pfam00232Glyco_hydro_1039512482+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005773vacuole
GO:0005777peroxisome
GO:0005975carbohydrate metabolic process
GO:0008422beta-glucosidase activity
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAL06896.1015411541AT5g26000/T1N24_7 [Arabidopsis thaliana]
GenBankAAL25596.1015411541AT5g26000/T1N24_7 [Arabidopsis thaliana]
EMBLCAH40799.10184971480thioglucoside glucohydrolase [Arabidopsis thaliana]
EMBLCAH40804.10435191477thioglucoside glucohydrolase [Arabidopsis thaliana]
RefSeqNP_851077.1015411541TGG1 (THIOGLUCOSIDE GLUCOHYDROLASE 1); hydrolase, hydrolyzing O-glycosyl compounds / thioglucosidase [Arabidopsis thaliana]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1myr_A0205122500A Chain A, Myrosinase From Sinapis Alba
PDB2wxd_M0205122500A Chain A, Myrosinase From Sinapis Alba
PDB1w9d_M0205122500A Chain A, Myrosinase From Sinapis Alba
PDB1w9b_M0205122500A Chain A, Myrosinase From Sinapis Alba
PDB1e73_M0205122500A Chain A, Myrosinase From Sinapis Alba
Signal Peptide
Cleavage Site
19
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EG525046322643850
EG5241082952345280
CB264645290163050
EG494673269162840
BU6350872552805340
Orthologous Group
SpeciesID
Arabidopsis lyrata917734489446
Arabidopsis thalianaAT5G25980.1AT5G48375.1AT5G25980.2AT1G51490.1AT5G26000.2
AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra023838Bra032343Bra036914
Bra020523.35.161Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470m
Thhalv10003945mThhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390m
Thhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny