Posts

HAMES_DBI.A12

Image
 Project Proposal      Hypothesis: By regulating the gene SLC5A1, we can also regulate the expression of the sodium-glucose cotransport 1 and the genetic disorder GGM. From this, we are able to induce GGM into a mouse through editing the gene. To test the expression of the mutation, we will feed the mouse food that contains glucose/galactose. If the mouse experiences diarrhea and excessive dehydration, we can conclude the gene editing as successful.       Introduction: For this semester, the digestive protein I chose all the way back in the beginning of February was sodium-glucose cotransport 1. Over the course of the semester, I learned a lot about this protein and the gene that is associated with it by traveling and mining information through different databases. Sodium-glucose cotransport 1 (SGLT1) is mainly found in kidney proximal tubes because it regulates sodium/sugar uptake levels in intestinal cells. The mutation in the gene that encodes ...

HAMES_DBI.A11

Image
 UCSC Genome Browser Part 3 6. a) The Repeat Masker screens DNA sequences for repeats and low complexity DNA sequences. b) SINE, LINE, LTR, DNA, Simple, Low Complexity, Satellite, RNA, Other, Unknown c) A densely shaded line represents a low number of combined base mismatch, base deletion, and base insertion associations. 10. a) LINE has the highest amount of associated repeats because it has the most amount of lighter shaded lines. b) LINE and SINE have the largest present because they have the most dense lines. c) It might be useful to see the frequency of SINES within a gene to see how well a gene might be able to be mutated or alter the expression. d) Knowing the location of microsatellites can be useful because they can be used in tracking point mutations, DNA repair, and replication errors in the gene.  e) Some assays that'll help a scientist with expression/gene regulation analysis would be PCR, CRISPR, RNA sequencing, and microarrays.

HAMES_DBI.A10

 ORegAnno in the UCSC Browser a) 1. I. Oreg #: OREG1768905 II. Region type: Transcription Factor Binding Site III. Transcription factors: MITF Ensembl - ENST00000394351 2. I. Oreg #: OREG1626239 II. Region type: Transcription Factor Binding Site III. Transcription factors: FOS Ensembl - ENST00000303562 3. I. Oreg #: OREG1939402 II. Region type: Transcription Factor Binding Site III. Transcription factors: TP53 Ensembl - ENST00000504290 b) I. Orange  II. It is important to know the regulatory elements in a gene because those elements regulate what parts of the DNA are replicated and how the gene/DNA might be manipulated to make a certain protein.  c) I think the first assignment was the most interesting to me because I was able to choose my proteins among 26 other ones, and find a lot of information about a new protein I had never heard of. I thought it was fascinating to dive into and learn about this new protein and new world that is database research. Some potentia...

HAMES_DBI.A9

Image
 UCSC Genome Browser Part 2 6. I. Description of the data: The NIH Genotype-Tissue Expression project is used to study genetic  variation and gene expression in human tissues. The track shows median gene expression in 52 tissues and 2 cell lines based on data from 17,382 tissues obtained from 948 post-mortem adults. II. Tissue samples were obtained using the GTEx operating procedures of tissue collection and specimens were reviewd by pathologists to verify the organ source. The Qiagen PAXgene non-formalin tissue preservation product was used to stabilize the specimens. The RNA-seq was performed by GTEx and were aligned to the human genome using STAR v2.5.3a assisted by the GENCODE 26 transcriptome defintion. Gene annotations were produced using a custome isoform collapsing procedure that excluded and then included extrons/introns. Gene expression levels were called via the RNA-SeQC tool after filtering it by unique mapping, proper pairing, and exon overlap.  III. The goal...

HAMES_DBI.A8

Image
 UCSC Genome Browser Gene: SLC5A1  Chromosome and position: chr22:32,043,261-32,113,029- 69,769 bp When NCBI RefSeq is put to full in the "Genes and Gene Predictions" section, the gene is mapped out and shows the introns associated with the gene.  When you click the track heading, you condense the newly added RefSeq track and similar subtracks. Clicking on the gene name sends you to an information page about the gene where you can find a summary of the gene, description, data displays, and methods of review for the data.  Browser image:

HAMES_DBI.A7

Image
 CollecTF Database Genome name: Human fecal microbial communities from Cork, Ireland - EM173 BIN ID: 2042536002_2 Ecosystem details: Host associated; human; digestive system; large intestine; fecal Genome name: Human fecal microbial communities from Orebro University Hospital, Sweden - Sample 10368 BIN ID: 2051223000_3 Ecosystem details: Host associated; human; digestive system; large intestine; fecal Genome name:  Human fecal microbial communities from Orebro University Hospital, Sweden - Sample 10368 BIN ID: 2051223000_4 Ecosystem details: Host associated; human; digestive system; large intestine; fecal Resulting transcription factors: CsoR Motif structure: Inverted repeat GC-content: 69.23% Regulatory mode: Activation: 0% Repression: 100% Dual: 0% Not specified: 0% TF conformation:  Monomer: 0% Dimer: 0% Tetramer: 100% Other: 0% Not specified: 0% Binding site type: Motif-associated: 1 Variable-motif-associated: 0 Non-motif-associated: 0 This transcription factor is i...

HAMES_DBI.A6

  Varsite Name: sodium/glucose cotransporter 1 ID number: P13866 Uniprot ID: P13866 Tissue details: Lines the absorptive cells in the small intestine and proximal tubes in the kidney in order to uptake glucose in the cells. Diseases: GGM (Congenital galactose/glucose malabsorption) Tissue specificity: Expressed mainly in the intestine and kidney Cofactors: cellular hexose transport, d efective SLC5A1 causes congenital glucose/galactose malabsorption (GGM), intestinal hexose transport Organs/systems of high level of expression: liver, heart, mouth Organs/system of low level of expression: gonads, brain, pituitary gland I. Three aspects that I am still curious about with my protein are the specifics of the disease that is caused by its mutation- GGM, it's role in other organisms besides humans, and more information on how it can be regulated by drugs and medication. II. A potential research project would be looking into gene manipulation/editing and gene expression of SLC5A1 through ...