Last updated: 2025-08-13
Checks: 7 0
Knit directory: ATAC_learning/
This reproducible R Markdown analysis was created with workflowr (version 1.7.1). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.
Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.
Great job! The global environment was empty. Objects defined in the global environment can affect the analysis in your R Markdown file in unknown ways. For reproduciblity it’s best to always run the code in an empty environment.
The command set.seed(20231016)
was run prior to running
the code in the R Markdown file. Setting a seed ensures that any results
that rely on randomness, e.g. subsampling or permutations, are
reproducible.
Great job! Recording the operating system, R version, and package versions is critical for reproducibility.
Nice! There were no cached chunks for this analysis, so you can be confident that you successfully produced the results during this run.
Great job! Using relative paths to the files within your workflowr project makes it easier to run your code on other machines.
Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility.
The results in this page were generated with repository version e9cbf29. See the Past versions tab to see a history of the changes made to the R Markdown and HTML files.
Note that you need to be careful to ensure that all relevant files for
the analysis have been committed to Git prior to generating the results
(you can use wflow_publish
or
wflow_git_commit
). workflowr only checks the R Markdown
file, but you know if there are other scripts or data files that it
depends on. Below is the status of the Git repository when the results
were generated:
Ignored files:
Ignored: .RData
Ignored: .Rhistory
Ignored: .Rproj.user/
Ignored: analysis/H3K27ac_integration_noM.Rmd
Ignored: data/ACresp_SNP_table.csv
Ignored: data/ARR_SNP_table.csv
Ignored: data/All_merged_peaks.tsv
Ignored: data/CAD_gwas_dataframe.RDS
Ignored: data/CTX_SNP_table.csv
Ignored: data/Collapsed_expressed_NG_peak_table.csv
Ignored: data/DEG_toplist_sep_n45.RDS
Ignored: data/FRiP_first_run.txt
Ignored: data/Final_four_data/
Ignored: data/Frip_1_reads.csv
Ignored: data/Frip_2_reads.csv
Ignored: data/Frip_3_reads.csv
Ignored: data/Frip_4_reads.csv
Ignored: data/Frip_5_reads.csv
Ignored: data/Frip_6_reads.csv
Ignored: data/GO_KEGG_analysis/
Ignored: data/HF_SNP_table.csv
Ignored: data/Ind1_75DA24h_dedup_peaks.csv
Ignored: data/Ind1_TSS_peaks.RDS
Ignored: data/Ind1_firstfragment_files.txt
Ignored: data/Ind1_fragment_files.txt
Ignored: data/Ind1_peaks_list.RDS
Ignored: data/Ind1_summary.txt
Ignored: data/Ind2_TSS_peaks.RDS
Ignored: data/Ind2_fragment_files.txt
Ignored: data/Ind2_peaks_list.RDS
Ignored: data/Ind2_summary.txt
Ignored: data/Ind3_TSS_peaks.RDS
Ignored: data/Ind3_fragment_files.txt
Ignored: data/Ind3_peaks_list.RDS
Ignored: data/Ind3_summary.txt
Ignored: data/Ind4_79B24h_dedup_peaks.csv
Ignored: data/Ind4_TSS_peaks.RDS
Ignored: data/Ind4_V24h_fraglength.txt
Ignored: data/Ind4_fragment_files.txt
Ignored: data/Ind4_fragment_filesN.txt
Ignored: data/Ind4_peaks_list.RDS
Ignored: data/Ind4_summary.txt
Ignored: data/Ind5_TSS_peaks.RDS
Ignored: data/Ind5_fragment_files.txt
Ignored: data/Ind5_fragment_filesN.txt
Ignored: data/Ind5_peaks_list.RDS
Ignored: data/Ind5_summary.txt
Ignored: data/Ind6_TSS_peaks.RDS
Ignored: data/Ind6_fragment_files.txt
Ignored: data/Ind6_peaks_list.RDS
Ignored: data/Ind6_summary.txt
Ignored: data/Knowles_4.RDS
Ignored: data/Knowles_5.RDS
Ignored: data/Knowles_6.RDS
Ignored: data/LiSiLTDNRe_TE_df.RDS
Ignored: data/MI_gwas.RDS
Ignored: data/SNP_GWAS_PEAK_MRC_id
Ignored: data/SNP_GWAS_PEAK_MRC_id.csv
Ignored: data/SNP_gene_cat_list.tsv
Ignored: data/SNP_supp_schneider.RDS
Ignored: data/TE_info/
Ignored: data/TFmapnames.RDS
Ignored: data/all_TSSE_scores.RDS
Ignored: data/all_four_filtered_counts.txt
Ignored: data/aln_run1_results.txt
Ignored: data/anno_ind1_DA24h.RDS
Ignored: data/anno_ind4_V24h.RDS
Ignored: data/annotated_gwas_SNPS.csv
Ignored: data/background_n45_he_peaks.RDS
Ignored: data/cardiac_muscle_FRIP.csv
Ignored: data/cardiomyocyte_FRIP.csv
Ignored: data/col_ng_peak.csv
Ignored: data/cormotif_full_4_run.RDS
Ignored: data/cormotif_full_4_run_he.RDS
Ignored: data/cormotif_full_6_run.RDS
Ignored: data/cormotif_full_6_run_he.RDS
Ignored: data/cormotif_probability_45_list.csv
Ignored: data/cormotif_probability_45_list_he.csv
Ignored: data/cormotif_probability_all_6_list.csv
Ignored: data/cormotif_probability_all_6_list_he.csv
Ignored: data/datasave.RDS
Ignored: data/embryo_heart_FRIP.csv
Ignored: data/enhancer_list_ENCFF126UHK.bed
Ignored: data/enhancerdata/
Ignored: data/filt_Peaks_efit2.RDS
Ignored: data/filt_Peaks_efit2_bl.RDS
Ignored: data/filt_Peaks_efit2_n45.RDS
Ignored: data/first_Peaksummarycounts.csv
Ignored: data/first_run_frag_counts.txt
Ignored: data/full_bedfiles/
Ignored: data/gene_ref.csv
Ignored: data/gwas_1_dataframe.RDS
Ignored: data/gwas_2_dataframe.RDS
Ignored: data/gwas_3_dataframe.RDS
Ignored: data/gwas_4_dataframe.RDS
Ignored: data/gwas_5_dataframe.RDS
Ignored: data/high_conf_peak_counts.csv
Ignored: data/high_conf_peak_counts.txt
Ignored: data/high_conf_peaks_bl_counts.txt
Ignored: data/high_conf_peaks_counts.txt
Ignored: data/hits_files/
Ignored: data/hyper_files/
Ignored: data/hypo_files/
Ignored: data/ind1_DA24hpeaks.RDS
Ignored: data/ind1_TSSE.RDS
Ignored: data/ind2_TSSE.RDS
Ignored: data/ind3_TSSE.RDS
Ignored: data/ind4_TSSE.RDS
Ignored: data/ind4_V24hpeaks.RDS
Ignored: data/ind5_TSSE.RDS
Ignored: data/ind6_TSSE.RDS
Ignored: data/initial_complete_stats_run1.txt
Ignored: data/left_ventricle_FRIP.csv
Ignored: data/median_24_lfc.RDS
Ignored: data/median_3_lfc.RDS
Ignored: data/mergedPeads.gff
Ignored: data/mergedPeaks.gff
Ignored: data/motif_list_full
Ignored: data/motif_list_n45
Ignored: data/motif_list_n45.RDS
Ignored: data/multiqc_fastqc_run1.txt
Ignored: data/multiqc_fastqc_run2.txt
Ignored: data/multiqc_genestat_run1.txt
Ignored: data/multiqc_genestat_run2.txt
Ignored: data/my_hc_filt_counts.RDS
Ignored: data/my_hc_filt_counts_n45.RDS
Ignored: data/n45_bedfiles/
Ignored: data/n45_files
Ignored: data/other_papers/
Ignored: data/peakAnnoList_1.RDS
Ignored: data/peakAnnoList_2.RDS
Ignored: data/peakAnnoList_24_full.RDS
Ignored: data/peakAnnoList_24_n45.RDS
Ignored: data/peakAnnoList_3.RDS
Ignored: data/peakAnnoList_3_full.RDS
Ignored: data/peakAnnoList_3_n45.RDS
Ignored: data/peakAnnoList_4.RDS
Ignored: data/peakAnnoList_5.RDS
Ignored: data/peakAnnoList_6.RDS
Ignored: data/peakAnnoList_Eight.RDS
Ignored: data/peakAnnoList_full_motif.RDS
Ignored: data/peakAnnoList_n45_motif.RDS
Ignored: data/siglist_full.RDS
Ignored: data/siglist_n45.RDS
Ignored: data/summarized_peaks_dataframe.txt
Ignored: data/summary_peakIDandReHeat.csv
Ignored: data/test.list.RDS
Ignored: data/testnames.txt
Ignored: data/toplist_6.RDS
Ignored: data/toplist_full.RDS
Ignored: data/toplist_full_DAR_6.RDS
Ignored: data/toplist_n45.RDS
Ignored: data/trimmed_seq_length.csv
Ignored: data/unclassified_full_set_peaks.RDS
Ignored: data/unclassified_n45_set_peaks.RDS
Ignored: data/xstreme/
Untracked files:
Untracked: RNA_seq_integration.Rmd
Untracked: Rplot.pdf
Untracked: Sig_meta
Untracked: analysis/.gitignore
Untracked: analysis/Cormotif_analysis_testing diff.Rmd
Untracked: analysis/Diagnosis-tmm.Rmd
Untracked: analysis/Expressed_RNA_associations.Rmd
Untracked: analysis/IF_counts_20x.Rmd
Untracked: analysis/LFC_corr.Rmd
Untracked: analysis/SVA.Rmd
Untracked: analysis/Tan2020.Rmd
Untracked: analysis/making_master_peaks_list.Rmd
Untracked: analysis/my_hc_filt_counts.csv
Untracked: code/Concatenations_for_export.R
Untracked: code/IGV_snapshot_code.R
Untracked: code/LongDARlist.R
Untracked: code/just_for_Fun.R
Untracked: my_plot.pdf
Untracked: my_plot.png
Untracked: output/cormotif_probability_45_list.csv
Untracked: output/cormotif_probability_all_6_list.csv
Untracked: setup.RData
Unstaged changes:
Modified: ATAC_learning.Rproj
Modified: analysis/AC_shared_analysis.Rmd
Modified: analysis/AF_HF_SNPs.Rmd
Modified: analysis/Cardiotox_SNPs.Rmd
Modified: analysis/Cormotif_analysis.Rmd
Modified: analysis/DEG_analysis.Rmd
Modified: analysis/DOX_DAR_heatmap.Rmd
Modified: analysis/Figure_4.Rmd
Modified: analysis/H3K27ac_integration.Rmd
Modified: analysis/Jaspar_motif.Rmd
Modified: analysis/Jaspar_motif_ff.Rmd
Modified: analysis/SNP_TAD_peaks.Rmd
Modified: analysis/Supp_Fig_12-19.Rmd
Modified: analysis/TE_analysis_ALL_DAR.Rmd
Modified: analysis/TE_analysis_norm.Rmd
Modified: analysis/final_four_analysis.Rmd
Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.
These are the previous versions of the repository in which changes were
made to the R Markdown
(analysis/Jaspar_motif_DAR_paper.Rmd
) and HTML
(docs/Jaspar_motif_DAR_paper.html
) files. If you’ve
configured a remote Git repository (see ?wflow_git_remote
),
click on the hyperlinks in the table below to view the files as they
were in that past version.
File | Version | Author | Date | Message |
---|---|---|---|---|
Rmd | e9cbf29 | reneeisnowhere | 2025-08-13 | wflow_publish("analysis/Jaspar_motif_DAR_paper.Rmd") |
library(tidyverse)
library(cowplot)
library(kableExtra)
library(broom)
library(RColorBrewer)
library(ChIPseeker)
library("TxDb.Hsapiens.UCSC.hg38.knownGene")
library("org.Hs.eg.db")
library(rtracklayer)
library(edgeR)
library(ggfortify)
library(limma)
library(readr)
library(BiocGenerics)
library(gridExtra)
library(VennDiagram)
library(scales)
library(Cormotif)
library(BiocParallel)
library(ggpubr)
library(devtools)
library(JASPAR2022)
# library(TFBSTools)
library(MotifDb)
library(BSgenome.Hsapiens.UCSC.hg38)
library(data.table)
library(universalmotif)
library(ggseqlogo)
# library(motifmatchr)
library(gridExtra)
ER_rat=1.25
toptable_results <- readRDS("data/Final_four_data/re_analysis/Toptable_results.RDS")
all_results <- toptable_results %>%
imap(~ .x %>% tibble::rownames_to_column(var = "rowname") %>%
mutate(source = .y)) %>%
bind_rows()
toplistall_RNA <- readRDS("data/other_papers/toplistall_RNA.RDS") %>%
mutate(logFC = logFC*(-1))
peakAnnoList_DOX_DAR <- readRDS("data/Final_four_data/re_analysis/DOX_DAR_annotated_peaks_chipannno.RDS")
Assigned_genes_toPeak <- peakAnnoList_DOX_DAR$DOX_24 %>% as.data.frame() %>%
dplyr::select(mcols.genes,annotation, geneId, distanceToTSS) %>%
dplyr::rename("Peakid"=mcols.genes)
RNA_results <-
toplistall_RNA %>%
dplyr::select(time:logFC) %>%
tidyr::unite("sample",time, id) %>%
pivot_wider(., id_cols = c(ENTREZID,SYMBOL),names_from = sample, values_from = logFC) %>%
rename_with(~ str_replace(., "hours", "RNA"))
RNA_all_expressed <-toplistall_RNA %>%
dplyr::select(time:logFC,adj.P.Val) %>%
dplyr::filter(id=="DOX") %>%
dplyr::filter(time=="24_hours") %>%
tidyr::unite("sample",time, id) %>%
dplyr::select(ENTREZID, SYMBOL)
RNA_24_DOX_DEG <-toplistall_RNA %>%
dplyr::select(time:logFC,adj.P.Val) %>%
dplyr::filter(id=="DOX") %>%
dplyr::filter(time=="24_hours"& adj.P.Val<0.05) %>%
tidyr::unite("sample",time, id) %>%
dplyr::select(ENTREZID, SYMBOL)
RNA_3_DOX_DEG <-toplistall_RNA %>%
dplyr::select(time:logFC,adj.P.Val) %>%
dplyr::filter(id=="DOX") %>%
dplyr::filter(time=="3_hours"& adj.P.Val<0.05) %>%
tidyr::unite("sample",time, id) %>%
dplyr::select(ENTREZID, SYMBOL)
RNA_24_EPI_DEG <-toplistall_RNA %>%
dplyr::select(time:logFC,adj.P.Val) %>%
dplyr::filter(id=="EPI") %>%
dplyr::filter(time=="24_hours"& adj.P.Val<0.05) %>%
tidyr::unite("sample",time, id) %>%
dplyr::select(ENTREZID, SYMBOL)
RNA_3_EPI_DEG <-toplistall_RNA %>%
dplyr::select(time:logFC,adj.P.Val) %>%
dplyr::filter(id=="EPI") %>%
dplyr::filter(time=="3_hours"& adj.P.Val<0.05) %>%
tidyr::unite("sample",time, id) %>%
dplyr::select(ENTREZID, SYMBOL)
RNA_24_DNR_DEG <-toplistall_RNA %>%
dplyr::select(time:logFC,adj.P.Val) %>%
dplyr::filter(id=="DNR") %>%
dplyr::filter(time=="24_hours"& adj.P.Val<0.05) %>%
tidyr::unite("sample",time, id) %>%
dplyr::select(ENTREZID, SYMBOL)
RNA_3_DNR_DEG <-toplistall_RNA %>%
dplyr::select(time:logFC,adj.P.Val) %>%
dplyr::filter(id=="DNR") %>%
dplyr::filter(time=="3_hours"& adj.P.Val<0.05) %>%
tidyr::unite("sample",time, id) %>%
dplyr::select(ENTREZID, SYMBOL)
RNA_24_MTX_DEG <-toplistall_RNA %>%
dplyr::select(time:logFC,adj.P.Val) %>%
dplyr::filter(id=="MTX") %>%
dplyr::filter(time=="24_hours"& adj.P.Val<0.05) %>%
tidyr::unite("sample",time, id) %>%
dplyr::select(ENTREZID, SYMBOL)
RNA_3_MTX_DEG <-toplistall_RNA %>%
dplyr::select(time:logFC,adj.P.Val) %>%
dplyr::filter(id=="MTX") %>%
dplyr::filter(time=="3_hours"& adj.P.Val<0.05) %>%
tidyr::unite("sample",time, id) %>%
dplyr::select(ENTREZID, SYMBOL)
Peak_gene_RNA_LFC <- Assigned_genes_toPeak %>%
left_join(., RNA_results, by =c("geneId"="ENTREZID"))
entrez_ids <- Assigned_genes_toPeak$geneId
gene_info <- AnnotationDbi::select(
org.Hs.eg.db,
keys = entrez_ids,
columns = c("SYMBOL"),
keytype = "ENTREZID"
)
gene_info_collapsed <- gene_info %>%
group_by(ENTREZID) %>%
summarise(SYMBOL = paste(unique(SYMBOL), collapse = ","), .groups = "drop")
DOX_24_DAR <- all_results %>%
dplyr::filter(source =="DOX_24") %>%
dplyr::filter(adj.P.Val<0.05) %>%
distinct(genes) %>%
dplyr::rename("Peakid"=genes)
DOX_3_DAR <- all_results %>%
dplyr::filter(source =="DOX_3") %>%
dplyr::filter(adj.P.Val<0.05) %>%
distinct(genes) %>%
dplyr::rename("Peakid"=genes)
DOX_24_DAR_up <- all_results %>%
dplyr::filter(source =="DOX_24") %>%
dplyr::filter(adj.P.Val<0.05) %>%
dplyr::filter(logFC>0) %>%
distinct(genes) %>%
dplyr::rename("Peakid"=genes)
DOX_3_DAR_up <- all_results %>%
dplyr::filter(source =="DOX_3") %>%
dplyr::filter(adj.P.Val<0.05) %>%
dplyr::filter(logFC>0) %>%
distinct(genes) %>%
dplyr::rename("Peakid"=genes)
DOX_24_DAR_down <- all_results %>%
dplyr::filter(source =="DOX_24") %>%
dplyr::filter(adj.P.Val<0.05) %>%
dplyr::filter(logFC<0) %>%
distinct(genes) %>%
dplyr::rename("Peakid"=genes)
DOX_3_DAR_down <- all_results %>%
dplyr::filter(source =="DOX_3") %>%
dplyr::filter(adj.P.Val<0.05) %>%
dplyr::filter(logFC<0) %>%
distinct(genes) %>%
dplyr::rename("Peakid"=genes)
### enrichment ratio default number
ER_rat <- 1.25
sample code for how data sets were produced for 20kb DAR-DEGs and DAR-nonDEGs:
DOX_24_DAR_DEG_20kb <-
annotated_regions %>%
dplyr::filter(Peakid %in%DOX_24_DAR$Peakid ) %>%
dplyr::filter(distanceToTSS >-20000 & distanceToTSS<20000) %>%
separate(Peakid, into = c("seqnames", "start", "end"), sep = "\\.", convert = TRUE, remove=FALSE) %>%
dplyr::filter(RNA_expressed=="exp_RNA")
DOX_24_DAR_nonDEG_20kb <-
annotated_regions %>%
dplyr::filter(Peakid %in%DOX_24_DAR$Peakid ) %>%
dplyr::filter(distanceToTSS >-20000 & distanceToTSS<20000) %>%
separate(Peakid, into = c("seqnames", "start", "end"), sep = "\\.", convert = TRUE, remove=FALSE) %>%
dplyr::filter(RNA_expressed=="not_exp_RNA")
EPI_24_DAR_DEG_20kb <-
annotated_regions %>%
dplyr::filter(Peakid %in%EPI_24_DAR$Peakid ) %>%
dplyr::filter(distanceToTSS >-20000 & distanceToTSS<20000) %>%
separate(Peakid, into = c("seqnames", "start", "end"), sep = "\\.", convert = TRUE, remove=FALSE) %>%
dplyr::filter(RNA_expressed=="exp_RNA")
EPI_24_DAR_nonDEG_20kb <-
annotated_regions %>%
dplyr::filter(Peakid %in%EPI_24_DAR$Peakid ) %>%
dplyr::filter(distanceToTSS >-20000 & distanceToTSS<20000) %>%
separate(Peakid, into = c("seqnames", "start", "end"), sep = "\\.", convert = TRUE, remove=FALSE) %>%
dplyr::filter(RNA_expressed=="not_exp_RNA")
DNR_24_DAR_DEG_20kb <-
annotated_regions %>%
dplyr::filter(Peakid %in%DNR_24_DAR$Peakid ) %>%
dplyr::filter(distanceToTSS >-20000 & distanceToTSS<20000) %>%
separate(Peakid, into = c("seqnames", "start", "end"), sep = "\\.", convert = TRUE, remove=FALSE) %>%
dplyr::filter(RNA_expressed=="exp_RNA")
DNR_24_DAR_nonDEG_20kb <-
annotated_regions %>%
dplyr::filter(Peakid %in%DNR_24_DAR$Peakid ) %>%
dplyr::filter(distanceToTSS >-20000 & distanceToTSS<20000) %>%
separate(Peakid, into = c("seqnames", "start", "end"), sep = "\\.", convert = TRUE, remove=FALSE) %>%
dplyr::filter(RNA_expressed=="not_exp_RNA")
MTX_24_DAR_DEG_20kb <-
annotated_regions %>%
dplyr::filter(Peakid %in%MTX_24_DAR$Peakid ) %>%
dplyr::filter(distanceToTSS >-20000 & distanceToTSS<20000) %>%
separate(Peakid, into = c("seqnames", "start", "end"), sep = "\\.", convert = TRUE, remove=FALSE) %>%
dplyr::filter(RNA_expressed=="exp_RNA")
MTX_24_DAR_nonDEG_20kb <-
annotated_regions %>%
dplyr::filter(Peakid %in%MTX_24_DAR$Peakid ) %>%
dplyr::filter(distanceToTSS >-20000 & distanceToTSS<20000) %>%
separate(Peakid, into = c("seqnames", "start", "end"), sep = "\\.", convert = TRUE, remove=FALSE) %>%
dplyr::filter(RNA_expressed=="not_exp_RNA")
df_names_DAR_DEGS <- ls(pattern = ".*_DAR_DEG_.*kb$")
df_names_DAR_nonDEGS <- ls(pattern = ".*_DAR_nonDEG_.*kb$")
df_list_DAR_DEGS <- mget(df_names_DAR_DEGS)
df_list_DAR_nonDEGS <- mget(df_names_DAR_nonDEGS)
# Step 3: Convert each dataframe to GRanges
gr_list_DAR_DEGS <- lapply(df_list_DAR_DEGS, function(df) {
GRanges(
seqnames = df$seqnames,
ranges = IRanges(start = df$start, end = df$end),
mcols = df %>% select(-seqnames, -start, -end)
)
})
gr_list_DAR_nonDEGS <- lapply(df_list_DAR_nonDEGS, function(df) {
GRanges(
seqnames = df$seqnames,
ranges = IRanges(start = df$start, end = df$end),
mcols = df %>% select(-seqnames, -start, -end)
)
})
# Optional: Name the GRanges list same as original data frames
names(gr_list_DAR_DEGS) <- df_names_DAR_DEGS
names(gr_list_DAR_nonDEGS) <- df_names_DAR_nonDEGS
# Folder with input BED files
output_dir <- "data/Final_four_data/re_analysis/motif_beds_centered"
# Create output folder if needed
dir.create(output_dir, showWarnings = FALSE)
# Loop through each BED file
for (name in names(gr_list_DAR_DEGS)) {
gr <- gr_list_DAR_DEGS[[name]]
# Recenter each region to 200 bp around its midpoint
gr_centered <- resize(gr, width = 200, fix = "center")
# Export to BED (auto converts to 0-based)
export(gr_centered, con = file.path(output_dir, paste0(name, "DAR_DEG_centered.bed")), format = "BED")
}
### not significant DAR regions for xstreme
for (name in names(gr_list_DAR_nonDEGS)) {
gr <- gr_list_DAR_nonDEGS[[name]]
# Recenter each region to 200 bp around its midpoint
gr_centered <- resize(gr, width = 200, fix = "center")
# Export to BED (auto converts to 0-based)
export(gr_centered, con = file.path(output_dir, paste0(name, "DAR_nonDEG_centered.bed")), format = "BED")
}
sample code for producing data from TACC
### replace MTX with whichever treatments you are running.
echo $(date) "Starting process"
ml tacc-apptainer
apptainer exec --bind `pwd`:/home/meme memesuite_latest.sif xstreme --oc MTX_24_20kb_xstreme/ --time 480 --streme-totallength 8000000 --meme-searchsize 100000 --desc description --dna --evt 0.05 --minw 6 --maxw 15 --align center --meme-mod zoops --m motif_databases/JASPAR/JASPAR2022_CORE_vertebrates_non-redundant_v2.meme --p fa_files/MTX_24_DEG_20kbDAR_DEG_centered.fa -n fa_files/MTX_24_nonDEG_20kbDAR_nonDEG_centered.fa
echo $(date) "MTX 24 20kb done"
DOX_24_20kb_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DOX_24_20kb_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
DOX_24_20kb_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DOX_24_20kb_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
DOX_24_20kb_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DOX_24_20kb_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
DOX_24_20kb_sea_all <- rbind(DOX_24_20kb_sea_disc, DOX_24_20kb_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
DOX_24_20kb_combo <- DOX_24_20kb_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., DOX_24_20kb_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
DOX_24_20kb_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in DOX24 20kb ") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA1134.1 (FOS::JUNB) | STREME-1 | 1-RATGASTCATY | RATGASTCATY | FOS::JUNB | 0 | 0 | 6210 | 17.4 | 4170 | 8.15 | 2.14 |
2 | 1 | MA1144.1 | FOSL2::JUND | MA1144.1 | KATGACTCAT | FOSL2::JUND | 0 | 0 | 5840 | 16.4 | 3860 | 7.55 | 2.17 |
3 | 1 | MA0099.3 | FOS::JUN | MA0099.3 | ATGAGTCAYM | FOS::JUN | 0 | 0 | 6400 | 18 | 4450 | 8.7 | 2.07 |
4 | 1 | MA1138.1 | FOSL2::JUNB | MA1138.1 | KRTGASTCAT | FOSL2::JUNB | 0 | 0 | 5770 | 16.2 | 3800 | 7.44 | 2.18 |
5 | 1 | MA1134.1 | FOS::JUNB | MA1134.1 | KATGASTCATHN | FOS::JUNB | 0 | 0 | 5270 | 14.8 | 3300 | 6.45 | 2.29 |
6 | 1 | MA1130.1 | FOSL2::JUN | MA1130.1 | NNRTGAGTCAYN | FOSL2::JUN | 0 | 0 | 4970 | 14 | 3040 | 5.95 | 2.35 |
7 | 1 | MA0477.2 | FOSL1 | MA0477.2 | NNATGACTCATNN | FOSL1 | 0 | 0 | 4470 | 12.6 | 2560 | 5 | 2.51 |
8 | 1 | MA1633.2 | BACH1 | MA1633.2 | ATGACTCAT | BACH1 | 0 | 0 | 5540 | 15.6 | 3620 | 7.08 | 2.2 |
9 | 1 | MA0491.2 | JUND | MA0491.2 | NNATGACTCATNN | JUND | 0 | 0 | 5050 | 14.2 | 3140 | 6.14 | 2.31 |
10 | 1 | MA1141.1 | FOS::JUND | MA1141.1 | NKATGAGTCATNN | FOS::JUND | 0 | 0 | 6780 | 19.1 | 4960 | 9.68 | 1.97 |
11 | 1 | MA1128.1 | FOSL1::JUN | MA1128.1 | NKATGACTCATNN | FOSL1::JUN | 0 | 0 | 5990 | 16.8 | 4140 | 8.1 | 2.08 |
12 | 1 | MA0476.1 (FOS) | MEME-1 | ATGASTCA | ATGASTCA | FOS | 0 | 0 | 6870 | 19.3 | 5100 | 9.97 | 1.94 |
13 | 1 | MA0476.1 | FOS | MA0476.1 | DVTGASTCATB | FOS | 0 | 0 | 6610 | 18.6 | 4820 | 9.42 | 1.97 |
14 | 1 | MA1135.1 | FOSB::JUNB | MA1135.1 | KRTGASTCAT | FOSB::JUNB | 5.48906932529625e-321 | 5.48906932529625e-321 | 6930 | 19.5 | 5210 | 10.2 | 1.91 |
15 | 1 | MA0490.2 | JUNB | MA0490.2 | NNATGACTCATNN | JUNB | 2.22999987709968e-318 | 6.65012359302318e-320 | 5500 | 15.5 | 3710 | 7.24 | 2.14 |
16 | 1 | MA0489.2 | Jun | MA0489.2 | NVTGACTCATNN | Jun | 5.96999999997705e-313 | 1.680000000216e-314 | 6560 | 18.4 | 4860 | 9.5 | 1.94 |
17 | 1 | MA1988.1 | Atf3 | MA1988.1 | NRTGACTCABN | Atf3 | 2.35e-309 | 6.19000000000023e-311 | 5230 | 14.7 | 3480 | 6.8 | 2.16 |
18 | 1 | MA0655.1 | JDP2 | MA0655.1 | ATGACTCAT | JDP2 | 2.78e-308 | 6.92e-310 | 6060 | 17 | 4350 | 8.49 | 2 |
19 | 1 | MA1137.1 | FOSL1::JUNB | MA1137.1 | NDRTGACTCATNN | FOSL1::JUNB | 3.45e-304 | 8.14999999999999e-306 | 5700 | 16 | 3990 | 7.79 | 2.05 |
20 | 1 | MA0478.1 | FOSL2 | MA0478.1 | KRRTGASTCAB | FOSL2 | 3.72e-298 | 8.34e-300 | 7320 | 20.6 | 5800 | 11.3 | 1.82 |
21 | 1 | MA1142.1 | FOSL1::JUND | MA1142.1 | NRTGACTCAT | FOSL1::JUND | 1.49e-293 | 3.19e-295 | 5790 | 16.3 | 4150 | 8.1 | 2.01 |
22 | 1 | MA0835.2 | BATF3 | MA0835.2 | DATGACTCATH | BATF3 | 1.8e-292 | 3.67e-294 | 5750 | 16.2 | 4110 | 8.03 | 2.01 |
23 | 1 | MA1634.1 | BATF | MA1634.1 | DATGACTCATH | BATF | 1.2e-289 | 2.35e-291 | 6360 | 17.9 | 4790 | 9.36 | 1.91 |
24 | 1 | MA0841.1 | NFE2 | MA0841.1 | VATGACTCATS | NFE2 | 3.97e-286 | 7.43e-288 | 4590 | 12.9 | 2950 | 5.77 | 2.24 |
25 | 1 | MA1132.1 | JUN::JUNB | MA1132.1 | KATGACKCAT | JUN::JUNB | 5.85e-285 | 1.05e-286 | 4920 | 13.8 | 3290 | 6.43 | 2.15 |
26 | 1 | MA0462.2 | BATF::JUN | MA0462.2 | DATGACTCATH | BATF::JUN | 1.5e-283 | 2.59e-285 | 6730 | 18.9 | 5240 | 10.2 | 1.85 |
27 | 1 | MA1101.2 | BACH2 | MA1101.2 | DWANCATGASTCATSNTWH | BACH2 | 1.12e-281 | 1.86e-283 | 5990 | 16.8 | 4440 | 8.68 | 1.94 |
28 | 1 | MA1928.1 | BNC2 | MA1928.1 | NDRTGAGTCABN | BNC2 | 9.1e-230 | 1.46e-231 | 4010 | 11.3 | 2670 | 5.21 | 2.16 |
29 | 1 | MA0496.3 | MAFK | MA0496.3 | NGMTGACTCAGCMNH | MAFK | 5.7e-154 | 8.81e-156 | 8820 | 24.8 | 8870 | 17.3 | 1.43 |
30 | 1 | MA0089.2 | MAFG::NFE2L1 | MA0089.2 | NVNATGACTCAGCADW | MAFG::NFE2L1 | 4.46e-151 | 6.67e-153 | 4670 | 13.1 | 3910 | 7.64 | 1.72 |
31 | 1 | MA0501.1 | MAF::NFE2 | MA0501.1 | ATGACTCAGCANWTY | MAF::NFE2 | 1.59e-112 | 2.3e-114 | 13700 | 38.5 | 15900 | 31 | 1.24 |
32 | 2 | MA0018.4 (CREB1) | STREME-2 | 2-ATGACW | ATGACW | CREB1 | 3.05e-102 | 4.27e-104 | 26900 | 75.7 | 35300 | 69 | 1.1 |
33 | 1 | MA0659.3 | Mafg | MA0659.3 | NWGMTGACTCAGCAN | Mafg | 5.9e-78 | 8.02e-80 | 13800 | 38.8 | 16600 | 32.5 | 1.19 |
34 | 3 | MA0718.1 | RAX | MA0718.1 | RYYAATTARH | RAX | 5.22e-73 | 6.89e-75 | 29400 | 82.7 | 39700 | 77.6 | 1.07 |
35 | 4 | 3-RATAAG | STREME-3 | 3-RATAAG | RATAAG | RATAAG | 1.84e-72 | 2.36e-74 | 26800 | 75.4 | 35700 | 69.7 | 1.08 |
36 | 5 | MA0808.1 (TEAD3) | STREME-4 | 4-ATWCMA | ATWCMA | TEAD3 | 4.13e-71 | 5.15e-73 | 29600 | 83.3 | 40100 | 78.4 | 1.06 |
37 | 3 | MA0720.1 | Shox2 | MA0720.1 | YYAATTAR | Shox2 | 3.06e-70 | 3.72e-72 | 29100 | 81.7 | 39200 | 76.6 | 1.07 |
38 | 1 | MA0150.2 | Nfe2l2 | MA0150.2 | CASNATGACTCAGCA | Nfe2l2 | 6.91e-70 | 8.16e-72 | 20700 | 58.1 | 26600 | 52 | 1.12 |
39 | 3 | MA0725.1 | VSX1 | MA0725.1 | YTAATTAN | VSX1 | 4.64e-69 | 5.33e-71 | 31100 | 87.4 | 42500 | 83.1 | 1.05 |
40 | 3 | MA0027.2 | EN1 | MA0027.2 | SYAATTAV | EN1 | 9.31e-69 | 1.04e-70 | 29800 | 83.8 | 40400 | 79 | 1.06 |
41 | 1 | MA1639.1 | MEIS1 | MA1639.1 | NHCATMAATCAHN | MEIS1 | 1.61e-66 | 1.76e-68 | 20100 | 56.5 | 25800 | 50.4 | 1.12 |
42 | 3 | MA0674.1 | NKX6-1 | MA0674.1 | NTMATTAA | NKX6-1 | 5.16e-66 | 5.52e-68 | 31500 | 88.5 | 43200 | 84.3 | 1.05 |
43 | 3 | MA0710.1 | NOTO | MA0710.1 | NYTAATTAVN | NOTO | 3.38e-65 | 3.53e-67 | 27900 | 78.5 | 37500 | 73.4 | 1.07 |
44 | 3 | MA0726.1 | VSX2 | MA0726.1 | CTAATTAN | VSX2 | 1.79e-64 | 1.82e-66 | 29500 | 82.9 | 40000 | 78.1 | 1.06 |
45 | 1 | MA1113.2 | PBX2 | MA1113.2 | NHCATAAATCAHN | PBX2 | 2.17e-64 | 2.16e-66 | 27800 | 78.1 | 37300 | 72.9 | 1.07 |
46 | 3 | MA0704.1 | Lhx4 | MA0704.1 | YTAATTAR | Lhx4 | 3.74e-64 | 3.65e-66 | 26500 | 74.6 | 35400 | 69.2 | 1.08 |
47 | 6 | MA0035.4 | GATA1 | MA0035.4 | NHCTAATCTDH | GATA1 | 4.93e-64 | 4.71e-66 | 30000 | 84.3 | 40800 | 79.7 | 1.06 |
48 | 3 | MA0716.1 | PRRX1 | MA0716.1 | YYAATTAR | PRRX1 | 5.33e-63 | 4.98e-65 | 26500 | 74.5 | 35400 | 69.2 | 1.08 |
49 | 3 | MA0882.1 | DLX6 | MA0882.1 | VYAATTAN | DLX6 | 1.89e-62 | 1.71e-64 | 28000 | 78.8 | 37700 | 73.7 | 1.07 |
50 | 7 | MA0676.1 | Nr2e1 | MA0676.1 | AAAAGTCAA | Nr2e1 | 1.91e-62 | 1.71e-64 | 23700 | 66.6 | 31200 | 60.9 | 1.09 |
51 | 3 | MA0902.2 | HOXB2 | MA0902.2 | SYAATTAS | HOXB2 | 2.9e-62 | 2.55e-64 | 21800 | 61.2 | 28300 | 55.4 | 1.1 |
52 | 3 | MA0717.1 | RAX2 | MA0717.1 | YYAATTAV | RAX2 | 3.04e-61 | 2.63e-63 | 29100 | 81.9 | 39500 | 77.2 | 1.06 |
53 | 3 | MA1476.2 | Dlx5 | MA1476.2 | NDGCAATTAGNN | Dlx5 | 5.82e-61 | 4.93e-63 | 30400 | 85.5 | 41500 | 81.1 | 1.05 |
54 | 1 | MA1640.1 | MEIS2 | MA1640.1 | NNATGATTTATGDNH | MEIS2 | 7.84e-61 | 6.51e-63 | 26800 | 75.4 | 36000 | 70.3 | 1.07 |
55 | 3 | MA0630.1 | SHOX | MA0630.1 | HTAATTRR | SHOX | 9.06e-61 | 7.39e-63 | 27200 | 76.6 | 36600 | 71.5 | 1.07 |
56 | 3 | MA1963.1 | SATB1 | MA1963.1 | WWDCTAATAAHWW | SATB1 | 1.79e-60 | 1.43e-62 | 24300 | 68.2 | 32100 | 62.7 | 1.09 |
57 | 3 | MA0675.1 | NKX6-2 | MA0675.1 | NYMATTAA | NKX6-2 | 6.14e-60 | 4.84e-62 | 29000 | 81.4 | 39200 | 76.7 | 1.06 |
58 | 3 | MA0885.2 | Dlx2 | MA0885.2 | NDGCAATTAGNN | Dlx2 | 5.07e-58 | 3.92e-60 | 29800 | 83.9 | 40700 | 79.5 | 1.06 |
59 | 3 | MA0721.1 | UNCX | MA0721.1 | YYAATTAV | UNCX | 2.47e-57 | 1.88e-59 | 27100 | 76 | 36400 | 71.1 | 1.07 |
60 | 3 | MA0723.2 | VAX2 | MA0723.2 | BYAATTAV | VAX2 | 2.52e-57 | 1.89e-59 | 26600 | 74.9 | 35700 | 69.8 | 1.07 |
61 | 3 | MA0125.1 | Nobox | MA0125.1 | TAATTRSY | Nobox | 2.77e-57 | 2.03e-59 | 29900 | 84.1 | 40800 | 79.7 | 1.05 |
62 | 3 | MA0680.2 | Pax7 | MA0680.2 | DWTAATCAATTAWNH | Pax7 | 4.43e-57 | 3.21e-59 | 23600 | 66.3 | 31200 | 60.9 | 1.09 |
63 | 8 | MA0845.1 | FOXB1 | MA0845.1 | WATGTAAATAT | FOXB1 | 6.42e-56 | 4.57e-58 | 28200 | 79.4 | 38200 | 74.7 | 1.06 |
64 | 3 | MA0876.1 | BSX | MA0876.1 | SYVATTAV | BSX | 2.83e-55 | 1.98e-57 | 29400 | 82.6 | 40000 | 78.2 | 1.06 |
65 | 9 | MA1480.1 | DPRX | MA1480.1 | AGATAATCCN | DPRX | 3.06e-55 | 2.11e-57 | 29200 | 82.1 | 39800 | 77.7 | 1.06 |
66 | 3 | MA0628.1 | POU6F1 | MA0628.1 | AYTAATTART | POU6F1 | 7.53e-55 | 5.12e-57 | 27700 | 77.9 | 37400 | 73.2 | 1.06 |
67 | 10 | MA0602.1 | Arid5a | MA0602.1 | SYAATATTGVDANH | Arid5a | 8.17e-55 | 5.47e-57 | 27900 | 78.5 | 37700 | 73.7 | 1.06 |
68 | 3 | MA0886.1 | EMX2 | MA0886.1 | VBTAATTARB | EMX2 | 8.59e-55 | 5.67e-57 | 19900 | 55.9 | 25800 | 50.4 | 1.11 |
69 | 3 | MA0648.1 | GSC | MA0648.1 | VYTAATCCSY | GSC | 1.08e-54 | 7.04e-57 | 29700 | 83.4 | 40500 | 79.1 | 1.05 |
70 | 3 | MA0713.1 | PHOX2A | MA0713.1 | TAATYYAATTA | PHOX2A | 2.23e-54 | 1.43e-56 | 23900 | 67.1 | 31600 | 61.8 | 1.09 |
71 | 11 | MA0786.1 | POU3F1 | MA0786.1 | WTATGCWAATKW | POU3F1 | 3.33e-54 | 2.11e-56 | 27300 | 76.7 | 36800 | 71.9 | 1.07 |
72 | 3 | MA1495.1 | HOXA1 | MA1495.1 | STAATTAS | HOXA1 | 4.87e-54 | 3.03e-56 | 16900 | 47.4 | 21500 | 41.9 | 1.13 |
73 | 11 | MA0142.1 | Pou5f1::Sox2 | MA0142.1 | CWTTGTYATGCAAAT | Pou5f1::Sox2 | 1.31e-53 | 8.04e-56 | 27300 | 76.7 | 36800 | 71.9 | 1.07 |
74 | 3 | MA0880.1 | Dlx3 | MA0880.1 | NYAATTAN | Dlx3 | 1.44e-53 | 8.72e-56 | 23500 | 66 | 31100 | 60.8 | 1.09 |
75 | 3 | MA0890.1 | GBX2 | MA0890.1 | RSYAATTARN | GBX2 | 2.17e-53 | 1.3e-55 | 25300 | 71 | 33800 | 66 | 1.08 |
76 | 12 | MA0790.1 | POU4F1 | MA0790.1 | ATGMATAATTAATG | POU4F1 | 6.01e-53 | 3.55e-55 | 24600 | 69.1 | 32700 | 64 | 1.08 |
77 | 12 | MA0791.1 | POU4F3 | MA0791.1 | RTGMATWATTAATGAR | POU4F3 | 6.95e-53 | 4.05e-55 | 21600 | 60.7 | 28300 | 55.3 | 1.1 |
78 | 3 | MA0635.1 | BARHL2 | MA0635.1 | VHTAAAYGRT | BARHL2 | 1.46e-52 | 8.38e-55 | 30000 | 84.2 | 41000 | 80.1 | 1.05 |
79 | 6 | MA1104.2 | GATA6 | MA1104.2 | HWWTCTTATCTNH | GATA6 | 2.78e-52 | 1.58e-54 | 21000 | 59 | 27500 | 53.7 | 1.1 |
80 | 3 | MA0700.2 | LHX2 | MA0700.2 | DDCCAATTADN | LHX2 | 1.26e-51 | 7.08e-54 | 26200 | 73.8 | 35300 | 68.9 | 1.07 |
81 | 8 | MA0041.2 | FOXD3 | MA0041.2 | WAWGHAAATAAACARY | FOXD3 | 1.6e-51 | 8.85e-54 | 27400 | 77 | 37000 | 72.3 | 1.06 |
82 | 3 | MA0708.2 | MSX2 | MA0708.2 | SCAATTAV | MSX2 | 3.42e-51 | 1.87e-53 | 28200 | 79.3 | 38300 | 74.8 | 1.06 |
83 | 3 | MA0881.1 | Dlx4 | MA0881.1 | VYAATTAN | Dlx4 | 4.9e-51 | 2.65e-53 | 22600 | 63.6 | 29900 | 58.4 | 1.09 |
84 | 3 | MA0658.1 | LHX6 | MA0658.1 | RCTAATTARY | LHX6 | 7.43e-51 | 3.97e-53 | 24400 | 68.6 | 32500 | 63.5 | 1.08 |
85 | 3 | MA0899.1 | HOXA10 | MA0899.1 | DGYMATAAAAH | HOXA10 | 1.76e-50 | 9.27e-53 | 29500 | 82.8 | 40200 | 78.6 | 1.05 |
86 | 3 | MA0709.1 | Msx3 | MA0709.1 | SCAATTAN | Msx3 | 1.99e-50 | 1.04e-52 | 21900 | 61.7 | 28900 | 56.5 | 1.09 |
87 | 11 | MA0792.1 | POU5F1B | MA0792.1 | TATGCWAAT | POU5F1B | 1.09e-49 | 5.6e-52 | 28100 | 78.8 | 38100 | 74.4 | 1.06 |
88 | 13 | MA0901.2 | HOXB13 | MA0901.2 | WDCCAATAAAAWWW | HOXB13 | 1.28e-49 | 6.55e-52 | 25100 | 70.4 | 33500 | 65.5 | 1.07 |
89 | 3 | MA0650.3 | Hoxa13 | MA0650.3 | NNCCAATAAAAN | Hoxa13 | 2.4e-49 | 1.21e-51 | 29000 | 81.4 | 39500 | 77.2 | 1.06 |
90 | 8 | MA0032.2 | FOXC1 | MA0032.2 | WAWGTAAAYAW | FOXC1 | 3.74e-49 | 1.87e-51 | 28400 | 80 | 38700 | 75.6 | 1.06 |
91 | 3 | MA1502.1 | HOXB8 | MA1502.1 | GYMATTAM | HOXB8 | 4.72e-49 | 2.33e-51 | 19300 | 54.3 | 25100 | 49.1 | 1.11 |
92 | 3 | MA0068.2 | PAX4 | MA0068.2 | CTAATTAG | PAX4 | 6.52e-49 | 3.18e-51 | 30800 | 86.5 | 42300 | 82.7 | 1.05 |
93 | 3 | MA0891.1 | GSC2 | MA0891.1 | NYTAATCCBH | GSC2 | 9.44e-49 | 4.56e-51 | 25500 | 71.8 | 34300 | 67 | 1.07 |
94 | 3 | MA0122.3 | Nkx3-2 | MA0122.3 | WWAAMCACTTAAN | Nkx3-2 | 1.69e-48 | 8.04e-51 | 28700 | 80.7 | 39100 | 76.4 | 1.06 |
95 | 3 | MA0910.2 | HOXD8 | MA0910.2 | GYMATTAM | HOXD8 | 2.07e-48 | 9.78e-51 | 19400 | 54.4 | 25200 | 49.2 | 1.11 |
96 | 3 | MA0707.2 | MNX1 | MA0707.2 | VYRATTAK | MNX1 | 4.94e-48 | 2.31e-50 | 18900 | 53.2 | 24600 | 48 | 1.11 |
97 | 3 | MA1549.1 | POU6F1 | MA1549.1 | RTAATGAGVN | POU6F1 | 1.22e-47 | 5.63e-50 | 19100 | 53.6 | 24800 | 48.4 | 1.11 |
98 | 11 | MA0785.1 | POU2F1 | MA0785.1 | AWTATGCWAATK | POU2F1 | 1.39e-47 | 6.36e-50 | 28100 | 79.1 | 38200 | 74.7 | 1.06 |
99 | 3 | MA0661.1 | MEOX1 | MA0661.1 | VSTAATTAHC | MEOX1 | 3.19e-47 | 1.45e-49 | 27300 | 76.8 | 37000 | 72.4 | 1.06 |
100 | 11 | MA0788.1 | POU3F3 | MA0788.1 | WWTATGCWAATTW | POU3F3 | 4.65e-47 | 2.09e-49 | 25800 | 72.5 | 34700 | 67.8 | 1.07 |
101 | 3 | MA0903.1 | HOXB3 | MA0903.1 | NNYMATTARN | HOXB3 | 6.64e-47 | 2.95e-49 | 27500 | 77.3 | 37300 | 72.9 | 1.06 |
102 | 3 | MA1498.2 | HOXA7 | MA1498.2 | SYMATTAV | HOXA7 | 1.39e-46 | 6.13e-49 | 26500 | 74.6 | 35800 | 70 | 1.07 |
103 | 6 | MA0036.3 | GATA2 | MA0036.3 | NBCTTATCTNH | GATA2 | 1.58e-46 | 6.89e-49 | 16600 | 46.8 | 21400 | 41.7 | 1.12 |
104 | 11 | MA0627.2 | POU2F3 | MA0627.2 | NWTATGCAAATNH | POU2F3 | 2.44e-46 | 1.05e-48 | 26200 | 73.6 | 35300 | 69 | 1.07 |
105 | 14 | 7-ABTGAG | STREME-7 | 7-ABTGAG | ABTGAG | ABTGAG | 5.21e-46 | 2.23e-48 | 33600 | 94.5 | 47000 | 91.9 | 1.03 |
106 | 15 | MA1623.1 (Stat2) | STREME-6 | 6-AACAGA | AACAGA | Stat2 | 5.66e-46 | 2.39e-48 | 32500 | 91.2 | 45100 | 88.1 | 1.04 |
107 | 6 | MA0712.2 | OTX2 | MA0712.2 | NDRGGATTARNN | OTX2 | 6.1e-46 | 2.56e-48 | 30800 | 86.6 | 42500 | 83 | 1.04 |
108 | 3 | MA0654.1 | ISX | MA0654.1 | YYAATTAR | ISX | 1.78e-45 | 7.38e-48 | 19700 | 55.3 | 25700 | 50.3 | 1.1 |
109 | 3 | MA0662.1 | MIXL1 | MA0662.1 | NBYAATTAVN | MIXL1 | 2.1e-45 | 8.65e-48 | 25100 | 70.6 | 33700 | 65.9 | 1.07 |
110 | 3 | MA0681.2 | PHOX2B | MA0681.2 | WWTAATCAAATTAWWW | PHOX2B | 3.8e-45 | 1.55e-47 | 20600 | 58 | 27100 | 53 | 1.09 |
111 | 3 | MA0887.1 | EVX1 | MA0887.1 | VVTAATTABS | EVX1 | 4.08e-45 | 1.65e-47 | 25900 | 72.7 | 34900 | 68.1 | 1.07 |
112 | 16 | MA1978.1 | ZNF354A | MA1978.1 | HWAADWATAAATGRAYWAWTT | ZNF354A | 5.1e-45 | 2.04e-47 | 26500 | 74.6 | 35900 | 70.1 | 1.06 |
113 | 17 | MA0143.4 | SOX2 | MA0143.4 | DNACAATGGNN | SOX2 | 6.51e-45 | 2.57e-47 | 30200 | 85 | 41600 | 81.2 | 1.05 |
114 | 18 | MA0495.3 | MAFF | MA0495.3 | DWGTCAGCATTTTWHN | MAFF | 6.54e-45 | 2.57e-47 | 29100 | 81.8 | 39800 | 77.7 | 1.05 |
115 | 9 | MA0714.1 | PITX3 | MA0714.1 | NYTAATCCC | PITX3 | 1.29e-44 | 5.03e-47 | 33100 | 93 | 46200 | 90.3 | 1.03 |
116 | 8 | MA0846.1 | FOXC2 | MA0846.1 | WAWGTAAACAWW | FOXC2 | 2.37e-44 | 9.15e-47 | 29800 | 83.8 | 40900 | 79.9 | 1.05 |
117 | 19 | 5-TCACAA | STREME-5 | 5-TCACAA | TCACAA | TCACAA | 2.59e-44 | 9.93e-47 | 31400 | 88.2 | 43400 | 84.8 | 1.04 |
118 | 3 | MA0706.1 | MEOX2 | MA0706.1 | DSTAATTAWN | MEOX2 | 5.09e-44 | 1.94e-46 | 19000 | 53.4 | 24800 | 48.5 | 1.1 |
119 | 3 | MA0158.2 | HOXA5 | MA0158.2 | GYMATTAS | HOXA5 | 9.12e-44 | 3.44e-46 | 13200 | 37.1 | 16600 | 32.4 | 1.14 |
120 | 3 | MA1519.1 | LHX5 | MA1519.1 | STAATTAV | LHX5 | 1.17e-43 | 4.37e-46 | 14400 | 40.5 | 18300 | 35.7 | 1.13 |
121 | 20 | MA0084.1 | SRY | MA0084.1 | NWWAACAAT | SRY | 2.08e-43 | 7.7e-46 | 25600 | 71.8 | 34400 | 67.3 | 1.07 |
122 | 3 | MA0793.1 | POU6F2 | MA0793.1 | ASCTMATTAW | POU6F2 | 2.48e-43 | 9.1e-46 | 16600 | 46.6 | 21400 | 41.8 | 1.12 |
123 | 6 | MA1970.1 | TRPS1 | MA1970.1 | NHTCTTATCTNH | TRPS1 | 3.41e-43 | 1.24e-45 | 18800 | 52.8 | 24500 | 47.9 | 1.1 |
124 | 21 | MA0052.4 | MEF2A | MA0052.4 | DDCTAAAAATAGMHH | MEF2A | 5.04e-43 | 1.82e-45 | 22600 | 63.6 | 30100 | 58.8 | 1.08 |
125 | 3 | MA1518.2 | Lhx1 | MA1518.2 | NNNTGCTAATTAGCANNN | Lhx1 | 6.24e-43 | 2.24e-45 | 26700 | 75.2 | 36200 | 70.8 | 1.06 |
126 | 3 | MA0699.1 | LBX2 | MA0699.1 | NCYAATTARN | LBX2 | 6.84e-43 | 2.43e-45 | 26100 | 73.3 | 35200 | 68.8 | 1.06 |
127 | 3 | MA0722.1 | VAX1 | MA0722.1 | YTAATTAH | VAX1 | 2.34e-42 | 8.26e-45 | 19100 | 53.6 | 25000 | 48.8 | 1.1 |
128 | 9 | MA0711.1 | OTX1 | MA0711.1 | HTAATCCS | OTX1 | 2.99e-42 | 1.05e-44 | 27700 | 77.9 | 37700 | 73.7 | 1.06 |
129 | 11 | MA0789.1 | POU3F4 | MA0789.1 | TATGCWAAT | POU3F4 | 4.92e-42 | 1.71e-44 | 24000 | 67.4 | 32100 | 62.8 | 1.07 |
130 | 6 | MA0766.2 | GATA5 | MA0766.2 | SWGATAASRN | GATA5 | 1.25e-41 | 4.32e-44 | 13900 | 39 | 17600 | 34.4 | 1.13 |
131 | 3 | MA1481.1 | DRGX | MA1481.1 | YYAATTAN | DRGX | 1.29e-41 | 4.4e-44 | 19600 | 55 | 25700 | 50.2 | 1.1 |
132 | 13 | MA0465.2 | CDX2 | MA0465.2 | NDGCAATAAAWN | CDX2 | 1.69e-41 | 5.74e-44 | 25700 | 72.2 | 34700 | 67.8 | 1.07 |
133 | 3 | MA0904.2 | HOXB5 | MA0904.2 | STAATTAS | HOXB5 | 1.94e-41 | 6.55e-44 | 12200 | 34.2 | 15200 | 29.7 | 1.15 |
134 | 3 | MA0888.1 | EVX2 | MA0888.1 | VVTAATTAVB | EVX2 | 2.18e-41 | 7.28e-44 | 30000 | 84.3 | 41300 | 80.6 | 1.05 |
135 | 3 | MA0666.2 | MSX1 | MA0666.2 | SCAATTAR | MSX1 | 4.13e-41 | 1.37e-43 | 23000 | 64.6 | 30700 | 59.9 | 1.08 |
136 | 3 | MA1471.1 | BARX2 | MA1471.1 | NWWAAYMATTAN | BARX2 | 4.18e-41 | 1.38e-43 | 26300 | 73.9 | 35600 | 69.6 | 1.06 |
137 | 3 | MA0875.1 | BARX1 | MA0875.1 | VCMATTAV | BARX1 | 5.41e-41 | 1.77e-43 | 27300 | 76.8 | 37200 | 72.7 | 1.06 |
138 | 3 | MA0634.1 | ALX3 | MA0634.1 | YYYAATTANV | ALX3 | 1.21e-40 | 3.95e-43 | 23200 | 65.3 | 31100 | 60.7 | 1.08 |
139 | 3 | MA0892.1 | GSX1 | MA0892.1 | NCYMATTAVN | GSX1 | 1.78e-40 | 5.75e-43 | 27300 | 76.7 | 37100 | 72.5 | 1.06 |
140 | 8 | MA0148.4 | FOXA1 | MA0148.4 | WWGTAAACATNN | FOXA1 | 3.71e-40 | 1.19e-42 | 30700 | 86.2 | 42400 | 82.8 | 1.04 |
141 | 8 | MA1607.1 | Foxl2 | MA1607.1 | DDWATGTAAACANN | Foxl2 | 4.63e-40 | 1.47e-42 | 30900 | 86.8 | 42700 | 83.4 | 1.04 |
142 | 17 | MA1152.1 | SOX15 | MA1152.1 | CYWTTGTTHW | SOX15 | 1.95e-39 | 6.17e-42 | 27300 | 76.8 | 37200 | 72.7 | 1.06 |
143 | 6 | MA0037.4 | Gata3 | MA0037.4 | NHTCTTATCTNH | Gata3 | 2.25e-38 | 7.06e-41 | 18700 | 52.5 | 24500 | 47.9 | 1.1 |
144 | 3 | MA0879.2 | DLX1 | MA0879.2 | VYAATTAN | DLX1 | 2.71e-38 | 8.45e-41 | 22900 | 64.5 | 30700 | 60 | 1.07 |
145 | 3 | MA0705.1 | Lhx8 | MA0705.1 | CTAATTAV | Lhx8 | 3.98e-38 | 1.23e-40 | 17400 | 49 | 22700 | 44.4 | 1.1 |
146 | 3 | MA1505.1 | HOXC8 | MA1505.1 | KTAATTAM | HOXC8 | 5.24e-38 | 1.61e-40 | 17600 | 49.5 | 23000 | 44.9 | 1.1 |
147 | 3 | MA1530.1 | NKX6-3 | MA1530.1 | VYCATTAWW | NKX6-3 | 9.39e-38 | 2.87e-40 | 21100 | 59.3 | 28000 | 54.7 | 1.08 |
148 | 17 | MA0077.1 | SOX9 | MA0077.1 | CYATTGTTY | SOX9 | 2.63e-37 | 7.97e-40 | 32600 | 91.7 | 45600 | 89 | 1.03 |
149 | 8 | MA1606.1 | Foxf1 | MA1606.1 | DWRTAAACANN | Foxf1 | 5.79e-37 | 1.74e-39 | 28800 | 80.9 | 39500 | 77.2 | 1.05 |
150 | 22 | MA0769.2 | TCF7 | MA0769.2 | NNCTTTGAWVT | TCF7 | 1.35e-36 | 4.04e-39 | 29500 | 83 | 40600 | 79.4 | 1.04 |
151 | 3 | MA0701.2 | LHX9 | MA0701.2 | CYAATTAR | LHX9 | 1.52e-36 | 4.53e-39 | 13300 | 37.4 | 16900 | 33.1 | 1.13 |
152 | 22 | MA0523.1 | TCF7L2 | MA0523.1 | DRASATCAAAGRVA | TCF7L2 | 6.39e-36 | 1.89e-38 | 20600 | 58 | 27400 | 53.6 | 1.08 |
153 | 3 | MA0889.1 | GBX1 | MA0889.1 | VCYAATTASY | GBX1 | 2.34e-35 | 6.85e-38 | 19100 | 53.8 | 25200 | 49.3 | 1.09 |
154 | 3 | MA0612.2 | EMX1 | MA0612.2 | BTAATTAS | EMX1 | 2.44e-35 | 7.1e-38 | 14400 | 40.4 | 18400 | 36 | 1.12 |
155 | 11 | MA1115.1 | POU5F1 | MA1115.1 | WHATGCAAATN | POU5F1 | 8.59e-35 | 2.47e-37 | 29600 | 83.3 | 40900 | 79.9 | 1.04 |
156 | 23 | MA0809.2 | TEAD4 | MA0809.2 | NNACATTCCARN | TEAD4 | 8.59e-35 | 2.47e-37 | 32100 | 90.2 | 44800 | 87.4 | 1.03 |
157 | 8 | MA1103.2 | FOXK2 | MA1103.2 | DDGTAAACANN | FOXK2 | 1.08e-34 | 3.08e-37 | 31100 | 87.6 | 43200 | 84.5 | 1.04 |
158 | 3 | MA0894.1 | HESX1 | MA0894.1 | GHTAATTRRH | HESX1 | 1.09e-34 | 3.1e-37 | 20000 | 56.2 | 26500 | 51.8 | 1.08 |
159 | 8 | MA0848.1 | FOXO4 | MA0848.1 | GTAAACA | FOXO4 | 1.15e-34 | 3.26e-37 | 29100 | 81.9 | 40100 | 78.3 | 1.04 |
160 | 24 | MA0108.2 | TBP | MA0108.2 | STATAAAWRSVVVBN | TBP | 1.18e-34 | 3.31e-37 | 28900 | 81.3 | 39800 | 77.7 | 1.05 |
161 | 3 | MA0644.2 | ESX1 | MA0644.2 | YYAATTAN | ESX1 | 1.36e-34 | 3.8e-37 | 16100 | 45.3 | 20900 | 40.9 | 1.11 |
162 | 3 | MA0075.3 | PRRX2 | MA0075.3 | CTAATTAN | PRRX2 | 1.7e-34 | 4.7e-37 | 14800 | 41.6 | 19100 | 37.3 | 1.12 |
163 | 17 | MA0868.2 | SOX8 | MA0868.2 | MGAACAATRG | SOX8 | 2.67e-34 | 7.34e-37 | 30100 | 84.6 | 41600 | 81.3 | 1.04 |
164 | 9 | MA0682.2 | PITX1 | MA0682.2 | YTAATCCH | PITX1 | 2.76e-34 | 7.55e-37 | 18900 | 53.1 | 24900 | 48.7 | 1.09 |
165 | 17 | MA1120.1 | SOX13 | MA1120.1 | DVACAATGGNN | SOX13 | 3.81e-34 | 1.03e-36 | 29600 | 83.3 | 40900 | 79.9 | 1.04 |
166 | 8 | MA0042.2 | FOXI1 | MA0042.2 | GTAAACA | FOXI1 | 3.83e-34 | 1.04e-36 | 26200 | 73.6 | 35600 | 69.6 | 1.06 |
167 | 3 | MA0642.2 | EN2 | MA0642.2 | VSYAATTA | EN2 | 5.03e-34 | 1.35e-36 | 25400 | 71.3 | 34400 | 67.3 | 1.06 |
168 | 17 | MA1563.2 | SOX18 | MA1563.2 | AACAATDV | SOX18 | 6.49e-34 | 1.73e-36 | 26700 | 75.1 | 36400 | 71.2 | 1.05 |
169 | 8 | MA1683.1 | FOXA3 | MA1683.1 | DWGTAAACANN | FOXA3 | 6.74e-34 | 1.79e-36 | 32000 | 89.9 | 44600 | 87.1 | 1.03 |
170 | 3 | MA1497.1 | HOXA6 | MA1497.1 | BGYMATTANN | HOXA6 | 1.41e-33 | 3.73e-36 | 27100 | 76.1 | 37000 | 72.3 | 1.05 |
171 | 8 | MA0047.3 | FOXA2 | MA0047.3 | RWGTAAACANN | FOXA2 | 3.34e-33 | 8.77e-36 | 28800 | 80.8 | 39600 | 77.3 | 1.05 |
172 | 3 | MA0912.2 | HOXD3 | MA0912.2 | CTAATTAC | HOXD3 | 3.92e-33 | 1.02e-35 | 10400 | 29.1 | 12900 | 25.3 | 1.15 |
173 | 12 | MA0046.2 | HNF1A | MA0046.2 | DRTTAATNATTAACN | HNF1A | 1.61e-32 | 4.17e-35 | 15600 | 44 | 20300 | 39.7 | 1.11 |
174 | 8 | MA0481.3 | FOXP1 | MA0481.3 | NDGTAAACANH | FOXP1 | 1.63e-32 | 4.21e-35 | 28900 | 81.2 | 39800 | 77.7 | 1.04 |
175 | 3 | MA0914.1 | ISL2 | MA0914.1 | GCAMTTAR | ISL2 | 2.14e-32 | 5.49e-35 | 29000 | 81.4 | 39900 | 78 | 1.04 |
176 | 12 | MA0153.2 | HNF1B | MA0153.2 | GTTAATNATTAAY | HNF1B | 3.14e-32 | 8.01e-35 | 14800 | 41.6 | 19200 | 37.5 | 1.11 |
177 | 25 | MA0679.2 | ONECUT1 | MA0679.2 | WAAAAATCAATAAWWN | ONECUT1 | 6.31e-32 | 1.6e-34 | 18800 | 52.8 | 24900 | 48.6 | 1.09 |
178 | 8 | MA0033.2 | FOXL1 | MA0033.2 | RTAAACA | FOXL1 | 1.05e-31 | 2.64e-34 | 29900 | 84 | 41300 | 80.8 | 1.04 |
179 | 3 | MA0893.2 | GSX2 | MA0893.2 | SYMATTAR | GSX2 | 1.53e-31 | 3.84e-34 | 29200 | 82.1 | 40300 | 78.8 | 1.04 |
180 | 3 | MA0900.2 | HOXA2 | MA0900.2 | STAATTAS | HOXA2 | 2.32e-31 | 5.79e-34 | 10600 | 29.9 | 13400 | 26.1 | 1.14 |
181 | 3 | MA0896.1 | Hmx1 | MA0896.1 | VSVAGCAATTAAWGVDB | Hmx1 | 3.92e-31 | 9.71e-34 | 18900 | 53.2 | 25100 | 49 | 1.09 |
182 | 25 | MA0755.1 | CUX2 | MA0755.1 | TRATCGATAW | CUX2 | 4.15e-31 | 1.02e-33 | 22000 | 62 | 29600 | 57.9 | 1.07 |
183 | 1 | MA0591.1 | Bach1::Mafk | MA0591.1 | RSSATGACTCAGCAB | Bach1::Mafk | 6.63e-31 | 1.63e-33 | 1310 | 3.69 | 1170 | 2.29 | 1.61 |
184 | 18 | MA0117.2 | Mafb | MA0117.2 | AWWNTGCTGACD | Mafb | 7.55e-31 | 1.84e-33 | 28100 | 79 | 38600 | 75.4 | 1.05 |
185 | 3 | MA0124.2 | Nkx3-1 | MA0124.2 | RCCACTTAA | Nkx3-1 | 1.73e-30 | 4.2e-33 | 29800 | 83.8 | 41300 | 80.7 | 1.04 |
186 | 26 | MA0018.4 | CREB1 | MA0018.4 | NNRTGATGTCAYN | CREB1 | 1.95e-30 | 4.7e-33 | 26000 | 73.2 | 35500 | 69.4 | 1.05 |
187 | 6 | MA0482.2 | GATA4 | MA0482.2 | NNCCTTATCTNH | GATA4 | 2.63e-30 | 6.32e-33 | 13400 | 37.7 | 17300 | 33.8 | 1.12 |
188 | 17 | MA0514.2 | Sox3 | MA0514.2 | DNACAATGGNN | Sox3 | 9.88e-30 | 2.36e-32 | 32600 | 91.8 | 45700 | 89.4 | 1.03 |
189 | 27 | MA0836.2 | CEBPD | MA0836.2 | NRTTGCACAAYHN | CEBPD | 2.47e-29 | 5.86e-32 | 28000 | 78.7 | 38500 | 75.3 | 1.05 |
190 | 28 | MA0905.1 | HOXC10 | MA0905.1 | GTCRTAAAAH | HOXC10 | 4.84e-29 | 1.14e-31 | 22300 | 62.6 | 30000 | 58.6 | 1.07 |
191 | 3 | MA0898.1 | Hmx3 | MA0898.1 | ASAAGCAATTAAWRRAT | Hmx3 | 9.72e-29 | 2.28e-31 | 18200 | 51.2 | 24100 | 47.2 | 1.09 |
192 | 26 | MA1143.1 | FOSL1::JUND | MA1143.1 | RTGACGTMAY | FOSL1::JUND | 2.67e-28 | 6.23e-31 | 23600 | 66.3 | 32000 | 62.4 | 1.06 |
193 | 3 | MA1463.1 | ARGFX | MA1463.1 | DCTAATTARM | ARGFX | 5.75e-28 | 1.34e-30 | 14500 | 40.7 | 18800 | 36.8 | 1.11 |
194 | 29 | MA0502.2 (NFYB) | STREME-9 | 9-CCAATG | CCAATG | NFYB | 9.09e-28 | 2.1e-30 | 34300 | 96.3 | 48500 | 94.7 | 1.02 |
195 | 3 | MA0895.1 | HMBOX1 | MA0895.1 | MYTAGTTAMS | HMBOX1 | 1.54e-27 | 3.55e-30 | 24700 | 69.5 | 33700 | 65.8 | 1.06 |
196 | 30 | MA1489.1 | FOXN3 | MA1489.1 | GTAAACAA | FOXN3 | 1.81e-27 | 4.15e-30 | 24200 | 68 | 32900 | 64.3 | 1.06 |
197 | 8 | MA0850.1 | FOXP3 | MA0850.1 | RTAAACA | FOXP3 | 5.82e-27 | 1.32e-29 | 29300 | 82.4 | 40600 | 79.3 | 1.04 |
198 | 17 | MA1603.1 | Dmrt1 | MA1603.1 | GHTACAAAGTADC | Dmrt1 | 5.87e-27 | 1.33e-29 | 26000 | 73 | 35500 | 69.4 | 1.05 |
199 | 3 | MA1500.1 | HOXB6 | MA1500.1 | BTAATKRC | HOXB6 | 1.16e-26 | 2.62e-29 | 22200 | 62.3 | 30000 | 58.6 | 1.06 |
200 | 31 | MA1623.1 | Stat2 | MA1623.1 | RGAAACAGAAASH | Stat2 | 1.94e-26 | 4.36e-29 | 31300 | 87.9 | 43700 | 85.3 | 1.03 |
201 | 17 | MA0078.2 | Sox17 | MA0078.2 | NNAGAACAATGGNN | Sox17 | 2.03e-26 | 4.52e-29 | 27300 | 76.7 | 37500 | 73.3 | 1.05 |
202 | 32 | MA1657.1 | ZNF652 | MA1657.1 | BAAAGRGTTAAN | ZNF652 | 2.06e-26 | 4.57e-29 | 21500 | 60.5 | 29000 | 56.7 | 1.07 |
203 | 22 | MA0768.2 | Lef1 | MA0768.2 | NNBCCTTTGATSTN | Lef1 | 6.1e-26 | 1.35e-28 | 30300 | 85.1 | 42100 | 82.3 | 1.03 |
204 | 3 | MA1608.1 | Isl1 | MA1608.1 | NNCCATTAGNN | Isl1 | 6.58e-26 | 1.45e-28 | 22700 | 63.8 | 30700 | 60 | 1.06 |
205 | 33 | MA1562.1 | SOX14 | MA1562.1 | CCGAACAATG | SOX14 | 1.6e-25 | 3.51e-28 | 27500 | 77.4 | 37900 | 74.1 | 1.04 |
206 | 31 | MA0050.3 | Irf1 | MA0050.3 | ANTGAAACTGAAASH | Irf1 | 2.95e-25 | 6.42e-28 | 30500 | 85.7 | 42400 | 82.9 | 1.03 |
207 | 3 | MA0853.1 | Alx4 | MA0853.1 | CGCRYTAATTARYHNNY | Alx4 | 5.29e-25 | 1.15e-27 | 15200 | 42.7 | 19900 | 39 | 1.09 |
208 | 8 | MA0852.2 | FOXK1 | MA0852.2 | NRDGTAAACAAGNN | FOXK1 | 1.02e-24 | 2.21e-27 | 32300 | 90.9 | 45300 | 88.6 | 1.03 |
209 | 34 | MA0611.2 | Dux | MA0611.2 | NTGATTBAATCAVWNN | Dux | 1.09e-24 | 2.34e-27 | 11300 | 31.8 | 14500 | 28.4 | 1.12 |
210 | 3 | MA0467.2 | Crx | MA0467.2 | NDGGATTANN | Crx | 2e-24 | 4.27e-27 | 22200 | 62.3 | 30000 | 58.7 | 1.06 |
211 | 35 | MA0774.1 | MEIS2 | MA0774.1 | TTGACAGS | MEIS2 | 2.98e-24 | 6.34e-27 | 33300 | 93.6 | 46900 | 91.6 | 1.02 |
212 | 36 | MA1125.1 (ZNF384) | MEME-2 | TTTTTTTTTTTTTTT | TTTTTTTTTTTTTTT | ZNF384 | 3.09e-24 | 6.54e-27 | 22900 | 64.4 | 31100 | 60.8 | 1.06 |
213 | 3 | MA0724.1 | VENTX | MA0724.1 | AMCGATTAR | VENTX | 3.46e-24 | 7.29e-27 | 14500 | 40.7 | 19000 | 37.1 | 1.1 |
214 | 13 | MA0913.2 | HOXD9 | MA0913.2 | GYMATAAAAH | HOXD9 | 3.91e-24 | 8.2e-27 | 19100 | 53.8 | 25600 | 50.1 | 1.07 |
215 | 3 | MA1504.1 | HOXC4 | MA1504.1 | VTMATTAN | HOXC4 | 5.32e-24 | 1.11e-26 | 21600 | 60.6 | 29100 | 57 | 1.06 |
216 | 37 | MA0606.2 | Nfat5 | MA0606.2 | NNATGGAAAAWN | Nfat5 | 6.69e-24 | 1.39e-26 | 29600 | 83.2 | 41100 | 80.4 | 1.04 |
217 | 3 | MA0897.1 | Hmx2 | MA0897.1 | ASAAGCAATTAAHVVRT | Hmx2 | 7.77e-24 | 1.61e-26 | 19200 | 54.1 | 25800 | 50.4 | 1.07 |
218 | 3 | MA1496.1 | HOXA4 | MA1496.1 | RTMATTAV | HOXA4 | 2.98e-23 | 6.13e-26 | 24300 | 68.2 | 33100 | 64.8 | 1.05 |
219 | 38 | MA0151.1 | Arid3a | MA0151.1 | ATYAAA | Arid3a | 3.59e-23 | 7.35e-26 | 15600 | 43.7 | 20600 | 40.2 | 1.09 |
220 | 9 | MA0883.1 | Dmbx1 | MA0883.1 | WDRWNMGGATTADKNNW | Dmbx1 | 4.12e-23 | 8.4e-26 | 17300 | 48.5 | 23000 | 44.9 | 1.08 |
221 | 3 | MA0909.3 | Hoxd13 | MA0909.3 | NNCAATAAAW | Hoxd13 | 4.21e-23 | 8.55e-26 | 17600 | 49.6 | 23500 | 45.9 | 1.08 |
222 | 26 | MA1112.2 | NR4A1 | MA1112.2 | NNAAAGGTCANN | NR4A1 | 6.52e-23 | 1.32e-25 | 29100 | 81.7 | 40300 | 78.8 | 1.04 |
223 | 28 | MA0911.1 | Hoxa11 | MA0911.1 | RGTCGTAAAANT | Hoxa11 | 9.02e-23 | 1.82e-25 | 24200 | 68.1 | 33100 | 64.7 | 1.05 |
224 | 3 | MA1644.1 | NFYC | MA1644.1 | RDCCAATCASN | NFYC | 1.06e-22 | 2.11e-25 | 28700 | 80.7 | 39800 | 77.8 | 1.04 |
225 | 39 | MA0877.3 | BARHL1 | MA0877.3 | AMCGTTTA | BARHL1 | 1.27e-22 | 2.54e-25 | 29100 | 81.9 | 40500 | 79.1 | 1.04 |
226 | 28 | MA0908.1 | HOXD11 | MA0908.1 | RTCGTAAAAH | HOXD11 | 2.19e-22 | 4.35e-25 | 14100 | 39.8 | 18600 | 36.3 | 1.1 |
227 | 3 | MA0132.2 | PDX1 | MA0132.2 | VYAATTAR | PDX1 | 5.78e-22 | 1.14e-24 | 14600 | 41.1 | 19300 | 37.6 | 1.09 |
228 | 11 | MA0787.1 | POU3F2 | MA0787.1 | WTATGCWAATKA | POU3F2 | 1.19e-21 | 2.35e-24 | 19000 | 53.3 | 25500 | 49.8 | 1.07 |
229 | 23 | MA0808.1 | TEAD3 | MA0808.1 | RCATTCCW | TEAD3 | 1.44e-21 | 2.82e-24 | 13600 | 38.1 | 17800 | 34.8 | 1.1 |
230 | 3 | MA1501.1 | HOXB7 | MA1501.1 | GGTAATTAVS | HOXB7 | 2.61e-21 | 5.09e-24 | 18500 | 51.9 | 24800 | 48.4 | 1.07 |
231 | 40 | MA0063.2 | NKX2-5 | MA0063.2 | NNCACTCAANN | NKX2-5 | 2.88e-21 | 5.59e-24 | 27300 | 76.6 | 37700 | 73.6 | 1.04 |
232 | 26 | MA1636.1 | CEBPG | MA1636.1 | NDATGATGCAATMHH | CEBPG | 2.91e-21 | 5.62e-24 | 17300 | 48.7 | 23100 | 45.2 | 1.08 |
233 | 41 | MA0092.1 | Hand1::Tcf3 | MA0092.1 | BRTCTGGMWT | Hand1::Tcf3 | 4.52e-21 | 8.7e-24 | 32600 | 91.7 | 45900 | 89.7 | 1.02 |
234 | 3 | MA1499.1 | HOXB4 | MA1499.1 | RTMATTAR | HOXB4 | 8.96e-21 | 1.72e-23 | 13700 | 38.5 | 18000 | 35.2 | 1.09 |
235 | 42 | MA1479.1 | DMRTC2 | MA1479.1 | AATTGMTACATT | DMRTC2 | 1.94e-20 | 3.7e-23 | 26800 | 75.3 | 37000 | 72.3 | 1.04 |
236 | 8 | MA0157.3 | Foxo3 | MA0157.3 | NDGTAAACARNN | Foxo3 | 3.49e-20 | 6.64e-23 | 33800 | 95 | 47800 | 93.4 | 1.02 |
237 | 9 | MA0719.1 | RHOXF1 | MA0719.1 | NTRATCCN | RHOXF1 | 4.59e-20 | 8.7e-23 | 22600 | 63.7 | 30900 | 60.4 | 1.05 |
238 | 13 | MA0878.3 | CDX1 | MA0878.3 | GGYMATAAAA | CDX1 | 5.95e-20 | 1.12e-22 | 13300 | 37.4 | 17500 | 34.2 | 1.09 |
239 | 3 | MA1960.1 | MGA::EVX1 | MA1960.1 | RGGTGWTAATKD | MGA::EVX1 | 8.66e-20 | 1.63e-22 | 19300 | 54.4 | 26100 | 51 | 1.07 |
240 | 23 | MA0090.3 | TEAD1 | MA0090.3 | NNACATTCCAGSN | TEAD1 | 1.24e-19 | 2.32e-22 | 24400 | 68.6 | 33500 | 65.4 | 1.05 |
241 | 43 | MA1624.1 | Stat5a | MA1624.1 | NTTCCAAGAANH | Stat5a | 1.61e-19 | 3e-22 | 29700 | 83.6 | 41500 | 81 | 1.03 |
242 | 3 | MA0601.1 | Arid3b | MA0601.1 | ATATTAATWAN | Arid3b | 2.57e-19 | 4.77e-22 | 14800 | 41.6 | 19600 | 38.3 | 1.08 |
243 | 44 | MA0626.1 | Npas2 | MA0626.1 | NSCACGTGTN | Npas2 | 2.82e-19 | 5.22e-22 | 16900 | 47.6 | 22700 | 44.3 | 1.07 |
244 | 18 | MA0842.2 | NRL | MA0842.2 | AAWWNTGCTGACG | NRL | 2.84e-19 | 5.22e-22 | 9290 | 26.1 | 11900 | 23.3 | 1.12 |
245 | 3 | MA0715.1 | PROP1 | MA0715.1 | TAATYWAATTA | PROP1 | 4.39e-19 | 8.04e-22 | 13400 | 37.6 | 17600 | 34.4 | 1.09 |
246 | 21 | MA0497.1 | MEF2C | MA0497.1 | DDDCYAAAAATAGMW | MEF2C | 7.19e-19 | 1.31e-21 | 13500 | 38 | 17800 | 34.8 | 1.09 |
247 | 35 | MA0782.2 | PKNOX1 | MA0782.2 | NNTGAGTGACAGVNN | PKNOX1 | 1.28e-18 | 2.32e-21 | 12500 | 35.1 | 16400 | 32 | 1.1 |
248 | 43 | MA1625.1 | Stat5b | MA1625.1 | NNNTTCCCAGAANNN | Stat5b | 5.15e-18 | 9.31e-21 | 29800 | 83.8 | 41700 | 81.4 | 1.03 |
249 | 8 | MA0480.2 | Foxo1 | MA0480.2 | NDGTAAACANN | Foxo1 | 6.54e-18 | 1.18e-20 | 33800 | 94.9 | 47800 | 93.4 | 1.02 |
250 | 45 | MA0483.1 | Gfi1B | MA0483.1 | AAATCWCWGCH | Gfi1B | 7.59e-18 | 1.36e-20 | 13100 | 36.9 | 17300 | 33.8 | 1.09 |
251 | 26 | MA1131.1 | FOSL2::JUN | MA1131.1 | GRTGACGTMAT | FOSL2::JUN | 9.85e-18 | 1.76e-20 | 25200 | 70.9 | 34800 | 67.9 | 1.04 |
252 | 26 | MA0025.2 | NFIL3 | MA0025.2 | NRTTATGYAATNN | NFIL3 | 1.47e-17 | 2.63e-20 | 13800 | 38.8 | 18300 | 35.8 | 1.09 |
253 | 46 | MA1588.1 | ZNF136 | MA1588.1 | RWATTCTTGGTTGRC | ZNF136 | 1.76e-17 | 3.12e-20 | 16600 | 46.7 | 22300 | 43.5 | 1.07 |
254 | 3 | MA0618.1 | LBX1 | MA0618.1 | TTAATTAG | LBX1 | 2.73e-17 | 4.83e-20 | 16900 | 47.6 | 22700 | 44.4 | 1.07 |
255 | 37 | MA0152.2 | Nfatc2 | MA0152.2 | NNRNAATGGAAANWNH | Nfatc2 | 7.32e-17 | 1.29e-19 | 33100 | 92.9 | 46700 | 91.2 | 1.02 |
256 | 47 | MA1975.1 | ZNF214 | MA1975.1 | VWTCATCAABGTCCT | ZNF214 | 1.11e-16 | 1.95e-19 | 28400 | 80 | 39600 | 77.4 | 1.03 |
257 | 25 | MA0754.2 | CUX1 | MA0754.2 | TAATCGATAM | CUX1 | 1.19e-16 | 2.08e-19 | 17900 | 50.4 | 24200 | 47.3 | 1.07 |
258 | 9 | MA1547.2 | PITX2 | MA1547.2 | YTAATCCY | PITX2 | 1.32e-16 | 2.29e-19 | 12000 | 33.8 | 15800 | 30.9 | 1.09 |
259 | 8 | MA0031.1 | FOXD1 | MA0031.1 | GTAAACAW | FOXD1 | 1.49e-16 | 2.58e-19 | 15900 | 44.8 | 21300 | 41.7 | 1.07 |
260 | 8 | MA0851.1 | Foxj3 | MA0851.1 | NDAADGTAAACAAANNM | Foxj3 | 2.15e-16 | 3.71e-19 | 18200 | 51.2 | 24600 | 48.1 | 1.06 |
261 | 48 | MA1118.1 | SIX1 | MA1118.1 | GWAACCTGAKM | SIX1 | 2.27e-16 | 3.91e-19 | 29500 | 83 | 41300 | 80.6 | 1.03 |
262 | 31 | MA1709.1 | ZIM3 | MA1709.1 | RVMAACAGAAACCYM | ZIM3 | 6.37e-16 | 1.09e-18 | 23800 | 67 | 32800 | 64.1 | 1.05 |
263 | 26 | MA1127.1 | FOSB::JUN | MA1127.1 | GATGACGTCAT | FOSB::JUN | 9.29e-16 | 1.59e-18 | 27300 | 76.6 | 37900 | 74 | 1.04 |
264 | 17 | MA0095.3 | Yy1 | MA0095.3 | NNCAAAATGGCN | Yy1 | 1.55e-15 | 2.63e-18 | 33400 | 93.8 | 47200 | 92.2 | 1.02 |
265 | 49 | MA1729.1 | ZNF680 | MA1729.1 | GNCCAAGAAGAATDA | ZNF680 | 1.61e-15 | 2.72e-18 | 22500 | 63.4 | 30900 | 60.5 | 1.05 |
266 | 50 | MA1974.1 | ZNF211 | MA1974.1 | HWYATATACCAYRYW | ZNF211 | 1.62e-15 | 2.73e-18 | 15100 | 42.5 | 20200 | 39.6 | 1.07 |
267 | 40 | MA1523.1 | MSANTD3 | MA1523.1 | STVCACTCAC | MSANTD3 | 1.71e-15 | 2.87e-18 | 30300 | 85.1 | 42400 | 82.9 | 1.03 |
268 | 43 | MA0144.2 | STAT3 | MA0144.2 | YTTCYKGGAAD | STAT3 | 1.99e-15 | 3.34e-18 | 20000 | 56.1 | 27200 | 53.2 | 1.06 |
269 | 26 | MA0833.2 | ATF4 | MA0833.2 | DDATGATGCAATMN | ATF4 | 2.49e-15 | 4.15e-18 | 16400 | 46.2 | 22100 | 43.3 | 1.07 |
270 | 51 | MA1150.1 | RORB | MA1150.1 | AWWTRGGTCAH | RORB | 3.31e-15 | 5.5e-18 | 32400 | 91 | 45700 | 89.2 | 1.02 |
271 | 27 | MA0102.4 | CEBPA | MA0102.4 | NDATTGCACAATHN | CEBPA | 3.39e-15 | 5.61e-18 | 16800 | 47.2 | 22600 | 44.2 | 1.07 |
272 | 42 | MA1707.1 | DMRTA1 | MA1707.1 | AWTTGWTACAWT | DMRTA1 | 4.1e-15 | 6.77e-18 | 25500 | 71.7 | 35300 | 69 | 1.04 |
273 | 40 | MA1645.1 | NKX2-2 | MA1645.1 | NNNCCACTCAANNN | NKX2-2 | 5.53e-15 | 9.08e-18 | 25900 | 72.8 | 35900 | 70.1 | 1.04 |
274 | 52 | MA1593.1 | ZNF317 | MA1593.1 | WVACAGCAGAYW | ZNF317 | 7.38e-15 | 1.21e-17 | 33600 | 94.5 | 47600 | 93.1 | 1.02 |
275 | 53 | MA1124.1 | ZNF24 | MA1124.1 | CATTCATTCATTC | ZNF24 | 7.94e-15 | 1.3e-17 | 8450 | 23.7 | 10900 | 21.3 | 1.12 |
276 | 3 | MA0874.1 | Arx | MA0874.1 | VTBCRYTAATTARTKSW | Arx | 1.37e-14 | 2.23e-17 | 12200 | 34.2 | 16100 | 31.4 | 1.09 |
277 | 40 | MA1994.1 | Nkx2-1 | MA1994.1 | NVCACTTGARHNN | Nkx2-1 | 1.94e-14 | 3.14e-17 | 26000 | 73.1 | 36000 | 70.4 | 1.04 |
278 | 17 | MA0442.2 | SOX10 | MA0442.2 | NDAACAAAGVN | SOX10 | 2.08e-14 | 3.36e-17 | 26000 | 73 | 36000 | 70.4 | 1.04 |
279 | 23 | MA1121.1 | TEAD2 | MA1121.1 | NYACATTCCWNNS | TEAD2 | 5.71e-14 | 9.18e-17 | 24600 | 69.2 | 34000 | 66.5 | 1.04 |
280 | 35 | MA0775.1 | MEIS3 | MA0775.1 | DTGACAGS | MEIS3 | 6.18e-14 | 9.9e-17 | 6820 | 19.2 | 8690 | 17 | 1.13 |
281 | 22 | MA1991.1 | Hnf1A | MA1991.1 | NBCCTTTGATSTBN | Hnf1A | 8.18e-14 | 1.31e-16 | 17400 | 49 | 23600 | 46.2 | 1.06 |
282 | 54 | MA0140.2 | GATA1::TAL1 | MA0140.2 | MTTATCWSNNNNNHVCAG | GATA1::TAL1 | 1.27e-13 | 2.01e-16 | 6640 | 18.6 | 8450 | 16.5 | 1.13 |
283 | 26 | MA1632.1 | ATF2 | MA1632.1 | NNATGABGTCATN | ATF2 | 1.52e-13 | 2.41e-16 | 15300 | 42.9 | 20500 | 40.1 | 1.07 |
284 | 13 | MA1125.1 | ZNF384 | MA1125.1 | DNWMAAAAAAAA | ZNF384 | 1.79e-13 | 2.83e-16 | 16800 | 47.2 | 22700 | 44.4 | 1.06 |
285 | 26 | MA0093.3 | USF1 | MA0093.3 | NDGTCATGTGACHN | USF1 | 1.85e-13 | 2.91e-16 | 21500 | 60.3 | 29400 | 57.5 | 1.05 |
286 | 37 | MA0624.2 | Nfatc1 | MA0624.2 | NVATGGAAAAWN | Nfatc1 | 2.32e-13 | 3.64e-16 | 32000 | 90 | 45100 | 88.2 | 1.02 |
287 | 26 | MA0498.2 | MEIS1 | MA0498.2 | HTGACAD | MEIS1 | 5.55e-13 | 8.68e-16 | 20800 | 58.4 | 28500 | 55.6 | 1.05 |
288 | 48 | MA1119.1 | SIX2 | MA1119.1 | NNCTGAAACCTGATMY | SIX2 | 9.24e-13 | 1.44e-15 | 25000 | 70.3 | 34700 | 67.8 | 1.04 |
289 | 3 | MA1577.1 | TLX2 | MA1577.1 | BYAATTAR | TLX2 | 1.29e-12 | 2e-15 | 9040 | 25.4 | 11800 | 23.1 | 1.1 |
290 | 28 | MA0906.1 | HOXC12 | MA0906.1 | RGTCGTAAAAH | HOXC12 | 1.71e-12 | 2.65e-15 | 12800 | 36 | 17100 | 33.4 | 1.08 |
291 | 31 | MA1508.1 | IKZF1 | MA1508.1 | VVAACAGGAARN | IKZF1 | 2.23e-12 | 3.43e-15 | 35300 | 99.2 | 50500 | 98.7 | 1.01 |
292 | 55 | MA0847.3 | FOXD2 | MA0847.3 | SYTAARYAAACA | FOXD2 | 3.4e-12 | 5.22e-15 | 19300 | 54.4 | 26400 | 51.7 | 1.05 |
293 | 3 | MA1507.1 | HOXD4 | MA1507.1 | VTMATTAD | HOXD4 | 3.7e-12 | 5.66e-15 | 7900 | 22.2 | 10300 | 20 | 1.11 |
294 | 28 | MA0907.1 | HOXC13 | MA0907.1 | BCTCGTAAAAH | HOXC13 | 4.88e-12 | 7.45e-15 | 15600 | 43.8 | 21100 | 41.2 | 1.06 |
295 | 26 | MA0829.2 | SREBF1 | MA0829.2 | DNATCACSTGATNH | SREBF1 | 6.15e-12 | 9.35e-15 | 25600 | 72.1 | 35600 | 69.7 | 1.03 |
296 | 31 | MA1953.1 | FOXO1::ELF1 | MA1953.1 | RTMAACAGGAAGTN | FOXO1::ELF1 | 1.78e-11 | 2.7e-14 | 26000 | 73.1 | 36200 | 70.8 | 1.03 |
297 | 8 | MA0613.1 | FOXG1 | MA0613.1 | RTAAACAW | FOXG1 | 1.82e-11 | 2.75e-14 | 15400 | 43.4 | 20900 | 40.8 | 1.06 |
298 | 3 | MA0060.3 | NFYA | MA0060.3 | RRCCAATCAGM | NFYA | 1.94e-11 | 2.92e-14 | 13000 | 36.4 | 17400 | 33.9 | 1.07 |
299 | 56 | MA0101.1 | REL | MA0101.1 | BGGRNWTTCC | REL | 2.87e-11 | 4.31e-14 | 32400 | 91.1 | 45800 | 89.5 | 1.02 |
300 | 30 | MA1487.2 | FOXE1 | MA1487.2 | VYTAAAYAAACAAH | FOXE1 | 3.26e-11 | 4.88e-14 | 16800 | 47.2 | 22800 | 44.6 | 1.06 |
301 | 26 | MA1139.1 | FOSL2::JUNB | MA1139.1 | DATGACGTCATH | FOSL2::JUNB | 3.8e-11 | 5.67e-14 | 19200 | 54 | 26300 | 51.4 | 1.05 |
302 | 45 | MA0038.2 | GFI1 | MA0038.2 | BMAATCACDGCA | GFI1 | 3.88e-11 | 5.76e-14 | 7930 | 22.3 | 10300 | 20.2 | 1.1 |
303 | 34 | MA0884.2 | DUXA | MA0884.2 | YTRATTWAATYAR | DUXA | 5.97e-11 | 8.84e-14 | 5480 | 15.4 | 6960 | 13.6 | 1.13 |
304 | 22 | MA0869.2 | Sox11 | MA0869.2 | NRGAACAAAGVV | Sox11 | 6.61e-11 | 9.76e-14 | 33900 | 95.4 | 48200 | 94.3 | 1.01 |
305 | 57 | MA1524.2 | Msgn1 | MA1524.2 | VRRRACAAATGGTNNN | Msgn1 | 8.55e-11 | 1.26e-13 | 21800 | 61.3 | 30100 | 58.8 | 1.04 |
306 | 57 | MA0461.2 | Atoh1 | MA0461.2 | RMCATATGBY | Atoh1 | 1.15e-10 | 1.69e-13 | 23300 | 65.4 | 32200 | 63 | 1.04 |
307 | 3 | MA0702.2 | LMX1A | MA0702.2 | TTAATTAM | LMX1A | 2.54e-10 | 3.71e-13 | 9310 | 26.2 | 12300 | 24 | 1.09 |
308 | 26 | MA0043.3 | HLF | MA0043.3 | NNRTTATGCAACHN | HLF | 3.39e-10 | 4.94e-13 | 10600 | 29.8 | 14100 | 27.5 | 1.08 |
309 | 26 | MA0693.3 | Vdr | MA0693.3 | NNGAGTTCANN | Vdr | 3.48e-10 | 5.05e-13 | 29300 | 82.3 | 41200 | 80.4 | 1.02 |
310 | 37 | MA0625.2 | NFATC3 | MA0625.2 | RYGGAAAMH | NFATC3 | 3.83e-10 | 5.54e-13 | 35300 | 99.3 | 50600 | 98.9 | 1 |
311 | 26 | MA1126.1 | FOS::JUN | MA1126.1 | DRTGACGTCATHNDTN | FOS::JUN | 1.04e-09 | 1.51e-12 | 28500 | 80.2 | 40100 | 78.3 | 1.03 |
312 | 28 | MA1503.1 | HOXB9 | MA1503.1 | GTCGTAAAAH | HOXB9 | 1.09e-09 | 1.57e-12 | 6180 | 17.4 | 7980 | 15.6 | 1.12 |
313 | 58 | MA1105.2 | GRHL2 | MA1105.2 | NRAACAGGTTYN | GRHL2 | 1.45e-09 | 2.08e-12 | 23600 | 66.2 | 32700 | 63.9 | 1.04 |
314 | 59 | MA0479.1 | FOXH1 | MA0479.1 | BNSAATCCACA | FOXH1 | 2.12e-09 | 3.03e-12 | 6870 | 19.3 | 8940 | 17.5 | 1.11 |
315 | 35 | MA1114.1 | PBX3 | MA1114.1 | NNNTGAGTGACAGGBNN | PBX3 | 2.49e-09 | 3.55e-12 | 13500 | 38.1 | 18300 | 35.8 | 1.06 |
316 | 21 | MA0773.1 | MEF2D | MA0773.1 | DCTAWAAATAGM | MEF2D | 3.97e-09 | 5.63e-12 | 4680 | 13.2 | 5940 | 11.6 | 1.13 |
317 | 26 | MA0843.1 | TEF | MA0843.1 | BRTTACRTAAYM | TEF | 5.46e-09 | 7.72e-12 | 9860 | 27.7 | 13100 | 25.6 | 1.08 |
318 | 35 | MA0797.1 | TGIF2 | MA0797.1 | TGACAGSTGTCA | TGIF2 | 6.08e-09 | 8.58e-12 | 3900 | 11 | 4890 | 9.56 | 1.15 |
319 | 42 | MA1478.1 | DMRTA2 | MA1478.1 | AATTGHTACAWT | DMRTA2 | 6.1e-09 | 8.58e-12 | 25200 | 70.7 | 35100 | 68.6 | 1.03 |
320 | 51 | MA1151.1 | RORC | MA1151.1 | NWAWNTRGGTCA | RORC | 6.89e-09 | 9.67e-12 | 34600 | 97.2 | 49300 | 96.3 | 1.01 |
321 | 40 | MA0672.1 | NKX2-3 | MA0672.1 | VCCACTTRAV | NKX2-3 | 8.28e-09 | 1.16e-11 | 22400 | 63 | 31100 | 60.7 | 1.04 |
322 | 43 | MA0519.1 | Stat5a::Stat5b | MA0519.1 | DTTTCCMAGAA | Stat5a::Stat5b | 8.83e-09 | 1.23e-11 | 18000 | 50.5 | 24700 | 48.2 | 1.05 |
323 | 26 | MA0488.1 | JUN | MA0488.1 | DDRATGATGTCAT | JUN | 1.17e-08 | 1.63e-11 | 5350 | 15 | 6870 | 13.4 | 1.12 |
324 | 35 | MA1572.1 | TGIF2LY | MA1572.1 | TGACAGCTGTCA | TGIF2LY | 1.42e-08 | 1.96e-11 | 21100 | 59.3 | 29200 | 57.1 | 1.04 |
325 | 44 | MA0620.3 | MITF | MA0620.3 | NNNDRTCACGTGAYHNNN | MITF | 2.18e-08 | 3.01e-11 | 22100 | 62 | 30600 | 59.8 | 1.04 |
326 | 43 | MA0518.1 | Stat4 | MA0518.1 | YTTCYRGGAARNVR | Stat4 | 2.54e-08 | 3.49e-11 | 18400 | 51.7 | 25300 | 49.4 | 1.05 |
327 | 3 | MA0621.1 | mix-a | MA0621.1 | NDTTAATTAGN | mix-a | 2.98e-08 | 4.08e-11 | 7100 | 20 | 9310 | 18.2 | 1.1 |
328 | 21 | MA0660.1 | MEF2B | MA0660.1 | RCTAWAAATAGC | MEF2B | 4.11e-08 | 5.62e-11 | 4730 | 13.3 | 6050 | 11.8 | 1.13 |
329 | 51 | MA1111.1 | NR2F2 | MA1111.1 | NAAAGGTCANR | NR2F2 | 6.53e-08 | 8.9e-11 | 24300 | 68.3 | 33900 | 66.2 | 1.03 |
330 | 60 | MA0595.1 | SREBF1 | MA0595.1 | VTCACCCCAY | SREBF1 | 1.03e-07 | 1.4e-10 | 7590 | 21.3 | 10000 | 19.6 | 1.09 |
331 | 26 | MA1136.1 | FOSB::JUNB | MA1136.1 | RTGACGTCAT | FOSB::JUNB | 1.11e-07 | 1.51e-10 | 28200 | 79.1 | 39600 | 77.3 | 1.02 |
332 | 26 | MA0631.1 | Six3 | MA0631.1 | DATRGGGTATCAYNWNT | Six3 | 3.49e-07 | 4.71e-10 | 14300 | 40.1 | 19500 | 38.1 | 1.05 |
333 | 61 | MA1116.1 | RBPJ | MA1116.1 | BVTGGGAANN | RBPJ | 3.91e-07 | 5.26e-10 | 30500 | 85.8 | 43100 | 84.3 | 1.02 |
334 | 30 | MA0040.1 | Foxq1 | MA0040.1 | HATTGTTTATW | Foxq1 | 5.43e-07 | 7.29e-10 | 10900 | 30.6 | 14700 | 28.7 | 1.07 |
335 | 8 | MA0849.1 | FOXO6 | MA0849.1 | GTAAACA | FOXO6 | 8.96e-07 | 1.2e-09 | 20100 | 56.4 | 27800 | 54.4 | 1.04 |
336 | 11 | MA1962.1 | POU2F1::SOX2 | MA1962.1 | MATTTRCATMACAATRG | POU2F1::SOX2 | 9.1e-07 | 1.22e-09 | 7300 | 20.5 | 9650 | 18.9 | 1.09 |
337 | 62 | MA0525.2 | TP63 | MA0525.2 | RACATGTYGKGACATGTC | TP63 | 9.89e-07 | 1.32e-09 | 285 | 0.8 | 243 | 0.47 | 1.69 |
338 | 3 | MA0854.1 | Alx1 | MA0854.1 | CGMRYTAATTARTNMNY | Alx1 | 1.05e-06 | 1.39e-09 | 8210 | 23.1 | 10900 | 21.4 | 1.08 |
339 | 34 | MA1720.1 | ZNF85 | MA1720.1 | DHDGAGATTACWKCAK | ZNF85 | 1.32e-06 | 1.75e-09 | 17500 | 49.2 | 24100 | 47.1 | 1.04 |
340 | 28 | MA0873.1 | HOXD12 | MA0873.1 | RGTCGTAAAAH | HOXD12 | 1.41e-06 | 1.86e-09 | 9680 | 27.2 | 13000 | 25.4 | 1.07 |
341 | 60 | MA0596.1 | SREBF2 | MA0596.1 | RTGGGGTGAY | SREBF2 | 1.45e-06 | 1.9e-09 | 2060 | 5.8 | 2500 | 4.89 | 1.19 |
342 | 18 | MA1520.1 | MAF | MA1520.1 | NTGCTGASTCAGCAH | MAF | 1.54e-06 | 2.02e-09 | 1620 | 4.55 | 1920 | 3.74 | 1.22 |
343 | 63 | MA0689.1 | TBX20 | MA0689.1 | WAGGTGTGAAR | TBX20 | 2.9e-06 | 3.79e-09 | 16100 | 45.2 | 22100 | 43.2 | 1.05 |
344 | 63 | MA0800.1 | EOMES | MA0800.1 | RAGGTGTGAAAWN | EOMES | 3.37e-06 | 4.39e-09 | 29900 | 83.9 | 42200 | 82.4 | 1.02 |
345 | 64 | MA0682.2 (PITX1) | MEME-3 | CTGGGATTACAGGCR | CTGGGATTACAGGCR | PITX1 | 4.4e-06 | 5.73e-09 | 1100 | 3.1 | 1260 | 2.45 | 1.26 |
346 | 51 | MA0494.1 | Nr1h3::Rxra | MA0494.1 | TGACCTNNAGTRACCYYDN | Nr1h3::Rxra | 5.41e-06 | 7e-09 | 24400 | 68.6 | 34200 | 66.8 | 1.03 |
347 | 40 | MA0673.1 | NKX2-8 | MA0673.1 | VCACTTSAV | NKX2-8 | 5.42e-06 | 7e-09 | 30700 | 86.4 | 43500 | 85 | 1.02 |
348 | 65 | MA0062.3 | GABPA | MA0062.3 | NNCACTTCCTGTNN | GABPA | 6.06e-06 | 7.82e-09 | 35300 | 99.1 | 50500 | 98.7 | 1 |
349 | 26 | MA0492.1 | JUND | MA0492.1 | DDDRATGABGTCATN | JUND | 7.53e-06 | 9.68e-09 | 5870 | 16.5 | 7720 | 15.1 | 1.09 |
350 | 17 | MA0867.2 | SOX4 | MA0867.2 | RAACAAAGRV | SOX4 | 1.31e-05 | 1.67e-08 | 11700 | 32.9 | 15900 | 31.1 | 1.06 |
351 | 23 | MA0484.2 | HNF4G | MA0484.2 | DNCAAAGTCCANN | HNF4G | 1.31e-05 | 1.67e-08 | 11200 | 31.6 | 15300 | 29.8 | 1.06 |
352 | 26 | MA1129.1 | FOSL1::JUN | MA1129.1 | ATGACGTCAT | FOSL1::JUN | 1.78e-05 | 2.27e-08 | 31300 | 88 | 44400 | 86.7 | 1.01 |
353 | 62 | MA0106.3 | TP53 | MA0106.3 | RACATGYCCGGRCATGTY | TP53 | 1.89e-05 | 2.41e-08 | 127 | 0.36 | 85 | 0.17 | 2.14 |
354 | 17 | MA0087.2 | Sox5 | MA0087.2 | VRVAACAATGGNN | Sox5 | 1.93e-05 | 2.45e-08 | 15900 | 44.8 | 22000 | 42.9 | 1.04 |
355 | 44 | MA1997.1 | Olig2 | MA1997.1 | NVCAGCTGBN | Olig2 | 2.85e-05 | 3.6e-08 | 5090 | 14.3 | 6670 | 13 | 1.1 |
356 | 44 | MA0743.2 | SCRT1 | MA0743.2 | NDWKCAACAGGTGKNN | SCRT1 | 2.87e-05 | 3.61e-08 | 29100 | 81.7 | 41000 | 80.2 | 1.02 |
357 | 57 | MA0827.1 | OLIG3 | MA0827.1 | AMCATATGBY | OLIG3 | 3.12e-05 | 3.92e-08 | 27100 | 76.1 | 38100 | 74.5 | 1.02 |
358 | 26 | MA0639.1 | DBP | MA0639.1 | NRTTACGTAAYV | DBP | 3.13e-05 | 3.92e-08 | 7090 | 19.9 | 9450 | 18.5 | 1.08 |
359 | 29 | MA0502.2 | NFYB | MA0502.2 | CYCATTGGCCVV | NFYB | 3.24e-05 | 4.04e-08 | 8530 | 24 | 11500 | 22.4 | 1.07 |
360 | 63 | MA0802.1 | TBR1 | MA0802.1 | AGGTGTGAAA | TBR1 | 3.43e-05 | 4.28e-08 | 28100 | 78.9 | 39600 | 77.3 | 1.02 |
361 | 41 | MA1643.1 | NFIB | MA1643.1 | NNCCTGGCAYNGTGCCAAGNN | NFIB | 3.59e-05 | 4.46e-08 | 2290 | 6.44 | 2850 | 5.56 | 1.16 |
362 | 27 | MA0838.1 | CEBPG | MA0838.1 | ATTRCGCAAY | CEBPG | 3.64e-05 | 4.51e-08 | 2060 | 5.79 | 2540 | 4.96 | 1.17 |
363 | 44 | MA1635.1 | BHLHE22 | MA1635.1 | NVCAGCTGBN | BHLHE22 | 4.38e-05 | 5.41e-08 | 5180 | 14.6 | 6800 | 13.3 | 1.1 |
364 | 66 | MA0069.1 | PAX6 | MA0069.1 | TTCACGCWTSANTK | PAX6 | 5.96e-05 | 7.34e-08 | 17300 | 48.7 | 24000 | 46.9 | 1.04 |
365 | 67 | MA0080.6 (Spi1) | MEME-5 | CWBYCTYCCTCTSTC | CWBYCTYCCTCTSTC | Spi1 | 8.45e-05 | 1.04e-07 | 14800 | 41.5 | 20300 | 39.7 | 1.04 |
366 | 57 | MA1618.1 | Ptf1a | MA1618.1 | NNACAGATGTTNN | Ptf1a | 9.55e-05 | 1.17e-07 | 26600 | 74.6 | 37400 | 73.1 | 1.02 |
367 | 23 | MA0114.4 | HNF4A | MA0114.4 | DNCAAAGTCCANN | HNF4A | 9.89e-05 | 1.21e-07 | 15800 | 44.5 | 21900 | 42.7 | 1.04 |
368 | 26 | MA0592.3 | ESRRA | MA0592.3 | NBTCAAGGTCAHN | ESRRA | 0.000128 | 1.55e-07 | 32300 | 90.7 | 45900 | 89.6 | 1.01 |
369 | 26 | MA1133.1 | JUN::JUNB | MA1133.1 | KRTGACGTCATN | JUN::JUNB | 0.000137 | 1.67e-07 | 24900 | 69.9 | 34900 | 68.2 | 1.02 |
370 | 34 | MA0468.1 | DUX4 | MA0468.1 | TAAYYYAATCA | DUX4 | 0.000208 | 2.52e-07 | 3530 | 9.93 | 4560 | 8.92 | 1.11 |
371 | 44 | MA0633.2 | Twist2 | MA0633.2 | NVCAGCTGBN | Twist2 | 0.000213 | 2.58e-07 | 4890 | 13.8 | 6440 | 12.6 | 1.09 |
372 | 12 | MA0683.1 | POU4F2 | MA0683.1 | RTGCATAATTAATGAG | POU4F2 | 0.00051 | 6.16e-07 | 1980 | 5.57 | 2470 | 4.83 | 1.15 |
373 | 26 | MA0629.1 | Rhox11 | MA0629.1 | ANKNCGCTGTWAWDVRW | Rhox11 | 0.000564 | 6.78e-07 | 18100 | 50.8 | 25100 | 49.1 | 1.03 |
374 | 26 | MA1996.1 | Nr1H2 | MA1996.1 | NNRAGGTCANN | Nr1H2 | 0.000566 | 6.79e-07 | 33200 | 93.4 | 47400 | 92.6 | 1.01 |
375 | 35 | MA1571.1 | TGIF2LX | MA1571.1 | TGACAGCTGTCA | TGIF2LX | 0.00057 | 6.82e-07 | 14600 | 41 | 20100 | 39.4 | 1.04 |
376 | 52 | MA0691.1 | TFAP4 | MA0691.1 | AWCAGCTGWT | TFAP4 | 0.000599 | 7.15e-07 | 1240 | 3.48 | 1480 | 2.89 | 1.2 |
377 | 26 | MA1145.1 | FOSL2::JUND | MA1145.1 | NNRTGACGTCAYHSN | FOSL2::JUND | 0.000748 | 8.9e-07 | 21700 | 61.1 | 30400 | 59.5 | 1.03 |
378 | 68 | MA0029.1 | Mecom | MA0029.1 | AAGAYAAGATAANA | Mecom | 0.000752 | 8.93e-07 | 3100 | 8.72 | 4000 | 7.81 | 1.12 |
379 | 69 | MA0687.1 | SPIC | MA0687.1 | HAAAAGVGGAAGTA | SPIC | 0.000899 | 1.06e-06 | 27400 | 76.9 | 38600 | 75.5 | 1.02 |
380 | 41 | MA0698.1 | ZBTB18 | MA0698.1 | NATCCAGATGTKB | ZBTB18 | 0.00129 | 1.53e-06 | 11800 | 33.2 | 16200 | 31.7 | 1.05 |
381 | 41 | MA1123.2 | TWIST1 | MA1123.2 | NNDCCAGATGTBN | TWIST1 | 0.00195 | 2.3e-06 | 20500 | 57.6 | 28700 | 56 | 1.03 |
382 | 70 | MA1989.1 | Bcl11B | MA1989.1 | NNAAACCACAARNN | Bcl11B | 0.00254 | 2.99e-06 | 30800 | 86.4 | 43700 | 85.4 | 1.01 |
383 | 71 | MA0160.2 | NR4A2 | MA0160.2 | BWAAAGGTCA | NR4A2 | 0.00327 | 3.83e-06 | 27100 | 76.2 | 38300 | 74.8 | 1.02 |
384 | 72 | MA0161.2 | NFIC | MA0161.2 | NNCTTGGCANN | NFIC | 0.00395 | 4.61e-06 | 33100 | 93 | 47200 | 92.2 | 1.01 |
385 | 8 | MA0614.1 | Foxj2 | MA0614.1 | RTAAACAA | Foxj2 | 0.00492 | 5.74e-06 | 3820 | 10.7 | 5020 | 9.82 | 1.09 |
386 | 18 | MA1521.1 | MAFA | MA1521.1 | NTGCTGASTCAGCAN | MAFA | 0.00615 | 7.14e-06 | 915 | 2.57 | 1080 | 2.12 | 1.21 |
387 | 73 | MA0119.1 | NFIC::TLX1 | MA0119.1 | TGGCASSRWGCCAA | NFIC::TLX1 | 0.00662 | 7.68e-06 | 1710 | 4.8 | 2140 | 4.18 | 1.15 |
388 | 74 | MA1977.1 | ZNF324 | MA1977.1 | NNNWRGMAGHYAAGGATGGTT | ZNF324 | 0.00668 | 7.72e-06 | 13800 | 38.8 | 19100 | 37.4 | 1.04 |
389 | 51 | MA0071.1 | RORA | MA0071.1 | AWMWAGGTCA | RORA | 0.0119 | 1.37e-05 | 9730 | 27.4 | 13300 | 26.1 | 1.05 |
390 | 33 | MA1561.1 | SOX12 | MA1561.1 | ACCGAACAATV | SOX12 | 0.0122 | 1.41e-05 | 19500 | 54.7 | 27300 | 53.3 | 1.03 |
391 | 17 | MA0515.1 | Sox6 | MA0515.1 | CCWTTGTYYY | Sox6 | 0.0124 | 1.42e-05 | 3140 | 8.83 | 4110 | 8.02 | 1.1 |
392 | 63 | MA0688.1 | TBX2 | MA0688.1 | DAGGTGTGAAA | TBX2 | 0.013 | 1.49e-05 | 30900 | 86.8 | 43900 | 85.8 | 1.01 |
393 | 75 | MA0508.3 | PRDM1 | MA0508.3 | YNCTTTCTCTH | PRDM1 | 0.015 | 1.71e-05 | 13500 | 38 | 18800 | 36.6 | 1.04 |
394 | 76 | MA0520.1 | Stat6 | MA0520.1 | BDBTTCCWSAGAAVY | Stat6 | 0.0187 | 2.13e-05 | 10100 | 28.4 | 13900 | 27.1 | 1.05 |
395 | 77 | MA0780.1 | PAX3 | MA0780.1 | TAATYRATTA | PAX3 | 0.0208 | 2.36e-05 | 5620 | 15.8 | 7560 | 14.8 | 1.07 |
396 | 65 | MA0640.2 | ELF3 | MA0640.2 | NNCCACTTCCTGNT | ELF3 | 0.0217 | 2.46e-05 | 34100 | 95.9 | 48800 | 95.4 | 1.01 |
397 | 51 | MA1148.1 | PPARA::RXRA | MA1148.1 | AWNTRGGTYAAAGGTCAN | PPARA::RXRA | 0.0224 | 2.53e-05 | 24200 | 68 | 34100 | 66.7 | 1.02 |
398 | 73 | MA1528.1 | NFIX | MA1528.1 | STTGGCRNNGTGCCARB | NFIX | 0.0252 | 2.84e-05 | 2050 | 5.76 | 2620 | 5.13 | 1.12 |
399 | 28 | MA1506.1 | HOXD10 | MA1506.1 | RGTCGTAAAAH | HOXD10 | 0.031 | 3.49e-05 | 2970 | 8.35 | 3890 | 7.61 | 1.1 |
400 | 35 | MA0796.1 | TGIF1 | MA0796.1 | TGACAGCTGTCA | TGIF1 | 0.0346 | 3.88e-05 | 1070 | 3 | 1310 | 2.55 | 1.18 |
401 | 3 | MA0135.1 | Lhx3 | MA0135.1 | WAATTAATTAWYY | Lhx3 | 0.0355 | 3.97e-05 | 5570 | 15.7 | 7510 | 14.7 | 1.07 |
402 | 78 | MA0083.3 | SRF | MA0083.3 | TGMCCATATATGGKCA | SRF | 0.0361 | 4.03e-05 | 455 | 1.28 | 507 | 0.99 | 1.29 |
403 | 71 | MA1541.1 | NR6A1 | MA1541.1 | NYCAAGKTCAAGKTCAN | NR6A1 | 0.0365 | 4.06e-05 | 23300 | 65.4 | 32800 | 64.1 | 1.02 |
13 | 79 | MA0528.2 (ZNF263) | STREME-10 | 10-CCTCCTCCCACCCC | 10-CCTCCTCCCACCCC | ZNF263 | 0.0368 | 0.00283 | 73 | 0.21 | 64 | 0.13 | 1.64 |
14 | 80 | 8-TGCCTGCCAS | STREME-8 | 8-TGCCTGCCAS | TGCCTGCCAS | TGCCTGCCAS | 0.0485 | 0.00346 | 5690 | 16 | 7840 | 15.3 | 1.04 |
15 | 79 | MA1596.1 (ZNF460) | MEME-4 | GGRGGCKGRGGCDGG | GGRGGCKGRGGCDGG | ZNF460 | 5.57 | 0.371 | 96 | 0.27 | 131 | 0.26 | 1.06 |
DOX_24_20kb_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DOX_24_20kb_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DOX_24_20kb_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
dplyr::filter(EVALUE.y < 0.05) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
# slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*15.2), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./15.2,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DOX_24_20kb_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DOX_24_20kb_combo %>%
# dplyr::filter(ALT_ID=="MEME-5")
# dplyr::filter(CLUSTER=="64")
dplyr::filter(CONSENSUS.x=="CTGGGATTACAGGCR")
EPI_24_20kb_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/EPI_24_20kb_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
EPI_24_20kb_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/EPI_24_20kb_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
EPI_24_20kb_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/EPI_24_20kb_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
EPI_24_20kb_sea_all <- rbind(EPI_24_20kb_sea_disc, EPI_24_20kb_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
EPI_24_20kb_combo <- EPI_24_20kb_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., EPI_24_20kb_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
EPI_24_20kb_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in EPI_24h 20kb") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA1134.1 (FOS::JUNB) | STREME-1 | 1-RATGASTCATY | RATGASTCATY | FOS::JUNB | 0 | 0 | 5480 | 15 | 3320 | 6.6 | 2.28 |
2 | 1 | MA1138.1 | FOSL2::JUNB | MA1138.1 | KRTGASTCAT | FOSL2::JUNB | 0 | 0 | 6050 | 16.6 | 4020 | 7.98 | 2.08 |
3 | 1 | MA1144.1 | FOSL2::JUND | MA1144.1 | KATGACTCAT | FOSL2::JUND | 0 | 0 | 5460 | 15 | 3450 | 6.85 | 2.19 |
4 | 1 | MA1134.1 | FOS::JUNB | MA1134.1 | KATGASTCATHN | FOS::JUNB | 1.24998608397835e-320 | 1.06224113855868e-321 | 4810 | 13.2 | 2850 | 5.66 | 2.34 |
5 | 1 | MA1135.1 | FOSB::JUNB | MA1135.1 | KRTGASTCAT | FOSB::JUNB | 2.38001302914645e-319 | 2.20007432093107e-320 | 5540 | 15.2 | 3560 | 7.08 | 2.15 |
6 | 1 | MA0477.2 | FOSL1 | MA0477.2 | NNATGACTCATNN | FOSL1 | 9.10000244514804e-318 | 7.1199800222182e-319 | 4910 | 13.5 | 2950 | 5.87 | 2.3 |
7 | 1 | MA0491.2 | JUND | MA0491.2 | NNATGACTCATNN | JUND | 5.82000000000717e-313 | 3.90000000000735e-314 | 5580 | 15.3 | 3640 | 7.23 | 2.12 |
8 | 1 | MA1128.1 | FOSL1::JUN | MA1128.1 | NKATGACTCATNN | FOSL1::JUN | 4.80000000000004e-311 | 2.8200000000019e-312 | 4400 | 12.1 | 2510 | 4.98 | 2.43 |
9 | 1 | MA1130.1 | FOSL2::JUN | MA1130.1 | NNRTGAGTCAYN | FOSL2::JUN | 1.03e-309 | 5.36000000000007e-311 | 4430 | 12.2 | 2540 | 5.05 | 2.41 |
10 | 1 | MA0099.3 | FOS::JUN | MA0099.3 | ATGAGTCAYM | FOS::JUN | 2.61e-309 | 1.22000000000001e-310 | 4500 | 12.4 | 2610 | 5.18 | 2.38 |
11 | 1 | MA1141.1 | FOS::JUND | MA1141.1 | NKATGAGTCATNN | FOS::JUND | 1.05e-304 | 4.35e-306 | 4940 | 13.6 | 3040 | 6.05 | 2.24 |
12 | 1 | MA0489.2 | Jun | MA0489.2 | NVTGACTCATNN | Jun | 1.11e-304 | 4.35e-306 | 5450 | 15 | 3550 | 7.05 | 2.12 |
13 | 1 | MA0476.1 | FOS | MA0476.1 | DVTGASTCATB | FOS | 4.73e-304 | 1.71e-305 | 4350 | 12 | 2490 | 4.95 | 2.41 |
14 | 1 | MA1633.2 | BACH1 | MA1633.2 | ATGACTCAT | BACH1 | 3.87e-303 | 1.3e-304 | 4780 | 13.1 | 2910 | 5.77 | 2.28 |
15 | 1 | MA0490.2 | JUNB | MA0490.2 | NNATGACTCATNN | JUNB | 1.34e-301 | 4.18e-303 | 5420 | 14.9 | 3540 | 7.02 | 2.12 |
16 | 1 | MA0476.1 (FOS) | MEME-2 | ATGASTCASH | ATGASTCASH | FOS | 2.27e-295 | 6.66e-297 | 4640 | 12.8 | 2810 | 5.59 | 2.28 |
17 | 1 | MA1634.1 | BATF | MA1634.1 | DATGACTCATH | BATF | 1.68e-293 | 4.62e-295 | 5150 | 14.1 | 3310 | 6.57 | 2.15 |
18 | 1 | MA1137.1 | FOSL1::JUNB | MA1137.1 | NDRTGACTCATNN | FOSL1::JUNB | 1.48e-290 | 3.85e-292 | 5220 | 14.3 | 3400 | 6.75 | 2.12 |
19 | 1 | MA0462.2 | BATF::JUN | MA0462.2 | DATGACTCATH | BATF::JUN | 4.54e-289 | 1.12e-290 | 4890 | 13.4 | 3080 | 6.13 | 2.19 |
20 | 1 | MA1988.1 | Atf3 | MA1988.1 | NRTGACTCABN | Atf3 | 8.72999999999999e-288 | 2.05e-289 | 4530 | 12.4 | 2740 | 5.45 | 2.28 |
21 | 1 | MA0655.1 | JDP2 | MA0655.1 | ATGACTCAT | JDP2 | 2.06e-284 | 4.6e-286 | 6460 | 17.8 | 4710 | 9.35 | 1.9 |
22 | 1 | MA0478.1 | FOSL2 | MA0478.1 | KRRTGASTCAB | FOSL2 | 6.12e-280 | 1.31e-281 | 5160 | 14.2 | 3400 | 6.75 | 2.1 |
23 | 1 | MA1142.1 | FOSL1::JUND | MA1142.1 | NRTGACTCAT | FOSL1::JUND | 2.84e-273 | 5.79e-275 | 5480 | 15.1 | 3760 | 7.46 | 2.02 |
24 | 1 | MA1928.1 | BNC2 | MA1928.1 | NDRTGAGTCABN | BNC2 | 6.07e-270 | 1.19e-271 | 6050 | 16.6 | 4370 | 8.67 | 1.92 |
25 | 1 | MA0835.2 | BATF3 | MA0835.2 | DATGACTCATH | BATF3 | 2.35e-264 | 4.41e-266 | 4060 | 11.2 | 2420 | 4.8 | 2.33 |
26 | 1 | MA1132.1 | JUN::JUNB | MA1132.1 | KATGACKCAT | JUN::JUNB | 5.41e-262 | 9.76e-264 | 4870 | 13.4 | 3210 | 6.37 | 2.1 |
27 | 1 | MA1101.2 | BACH2 | MA1101.2 | DWANCATGASTCATSNTWH | BACH2 | 1.61e-260 | 2.8e-262 | 6040 | 16.6 | 4420 | 8.77 | 1.89 |
28 | 1 | MA0841.1 | NFE2 | MA0841.1 | VATGACTCATS | NFE2 | 1.87e-245 | 3.13e-247 | 5130 | 14.1 | 3560 | 7.08 | 1.99 |
29 | 1 | MA0089.2 | MAFG::NFE2L1 | MA0089.2 | NVNATGACTCAGCADW | MAFG::NFE2L1 | 5.17e-154 | 8.36e-156 | 6800 | 18.7 | 6090 | 12.1 | 1.54 |
30 | 1 | MA0496.3 | MAFK | MA0496.3 | NGMTGACTCAGCMNH | MAFK | 1.57e-137 | 2.45e-139 | 5360 | 14.7 | 4610 | 9.15 | 1.61 |
31 | 1 | MA0501.1 | MAF::NFE2 | MA0501.1 | ATGACTCAGCANWTY | MAF::NFE2 | 1.43e-126 | 2.17e-128 | 8160 | 22.4 | 8000 | 15.9 | 1.41 |
32 | 1 | MA0150.2 | Nfe2l2 | MA0150.2 | CASNATGACTCAGCA | Nfe2l2 | 9.41e-90 | 1.38e-91 | 8360 | 23 | 8740 | 17.4 | 1.32 |
33 | 2 | MA1991.1 (Hnf1A) | STREME-8 | 8-TCAAAG | TCAAAG | Hnf1A | 7.75e-78 | 1.1e-79 | 32700 | 89.7 | 43000 | 85.4 | 1.05 |
34 | 1 | MA1137.1 (FOSL1::JUNB) | STREME-2 | 2-ABTCAKM | ABTCAKM | FOSL1::JUNB | 7.34e-76 | 1.01e-77 | 24600 | 67.5 | 30900 | 61.4 | 1.1 |
35 | 1 | MA0659.3 | Mafg | MA0659.3 | NWGMTGACTCAGCAN | Mafg | 7.98e-76 | 1.07e-77 | 4670 | 12.8 | 4450 | 8.84 | 1.45 |
36 | 3 | MA0720.1 | Shox2 | MA0720.1 | YYAATTAR | Shox2 | 2.16e-74 | 2.82e-76 | 31100 | 85.5 | 40600 | 80.7 | 1.06 |
37 | 3 | MA0723.2 | VAX2 | MA0723.2 | BYAATTAV | VAX2 | 3.3e-73 | 4.18e-75 | 30900 | 85 | 40400 | 80.2 | 1.06 |
38 | 4 | MA0084.1 | SRY | MA0084.1 | NWWAACAAT | SRY | 2.58e-71 | 3.18e-73 | 32800 | 90 | 43200 | 85.9 | 1.05 |
39 | 3 | MA0027.2 | EN1 | MA0027.2 | SYAATTAV | EN1 | 4.92e-71 | 5.92e-73 | 31100 | 85.5 | 40700 | 80.8 | 1.06 |
40 | 5 | 9-ATNAGG | STREME-9 | 9-ATNAGG | ATNAGG | ATNAGG | 6.67e-71 | 7.82e-73 | 32100 | 88.1 | 42100 | 83.7 | 1.05 |
41 | 3 | MA0718.1 | RAX | MA0718.1 | RYYAATTARH | RAX | 2.42e-70 | 2.77e-72 | 29300 | 80.4 | 37900 | 75.3 | 1.07 |
42 | 3 | MA0725.1 | VSX1 | MA0725.1 | YTAATTAN | VSX1 | 2.84e-68 | 3.18e-70 | 32700 | 89.9 | 43200 | 85.9 | 1.05 |
43 | 1 | MA1639.1 | MEIS1 | MA1639.1 | NHCATMAATCAHN | MEIS1 | 3.66e-68 | 3.99e-70 | 25200 | 69.1 | 31900 | 63.3 | 1.09 |
44 | 6 | MA0676.1 | Nr2e1 | MA0676.1 | AAAAGTCAA | Nr2e1 | 4.08e-67 | 4.35e-69 | 24300 | 66.7 | 30700 | 60.9 | 1.1 |
45 | 7 | MA1963.1 | SATB1 | MA1963.1 | WWDCTAATAAHWW | SATB1 | 4.74e-67 | 4.94e-69 | 24300 | 66.7 | 30700 | 60.9 | 1.1 |
46 | 3 | MA0886.1 | EMX2 | MA0886.1 | VBTAATTARB | EMX2 | 8.66e-67 | 8.83e-69 | 23100 | 63.5 | 29000 | 57.5 | 1.1 |
47 | 3 | MA0902.2 | HOXB2 | MA0902.2 | SYAATTAS | HOXB2 | 2.65e-66 | 2.64e-68 | 22300 | 61.4 | 27900 | 55.4 | 1.11 |
48 | 3 | MA0726.1 | VSX2 | MA0726.1 | CTAATTAN | VSX2 | 2.13e-64 | 2.08e-66 | 32600 | 89.4 | 43000 | 85.5 | 1.05 |
49 | 3 | MA0704.1 | Lhx4 | MA0704.1 | YTAATTAR | Lhx4 | 4.31e-64 | 4.12e-66 | 31500 | 86.6 | 41400 | 82.3 | 1.05 |
50 | 1 | MA1113.2 | PBX2 | MA1113.2 | NHCATAAATCAHN | PBX2 | 6.29e-63 | 5.91e-65 | 28300 | 77.8 | 36600 | 72.8 | 1.07 |
51 | 3 | MA0710.1 | NOTO | MA0710.1 | NYTAATTAVN | NOTO | 4.18e-62 | 3.85e-64 | 25700 | 70.5 | 32700 | 65 | 1.08 |
52 | 3 | MA0644.2 | ESX1 | MA0644.2 | YYAATTAN | ESX1 | 1.11e-61 | 1e-63 | 24900 | 68.3 | 31600 | 62.8 | 1.09 |
53 | 8 | MA0142.1 | Pou5f1::Sox2 | MA0142.1 | CWTTGTYATGCAAAT | Pou5f1::Sox2 | 2.24e-61 | 1.98e-63 | 27700 | 76.1 | 35700 | 70.9 | 1.07 |
54 | 3 | MA0674.1 | NKX6-1 | MA0674.1 | NTMATTAA | NKX6-1 | 1.04e-60 | 8.99e-63 | 33000 | 90.5 | 43700 | 86.9 | 1.04 |
55 | 1 | MA1640.1 | MEIS2 | MA1640.1 | NNATGATTTATGDNH | MEIS2 | 2.65e-59 | 2.26e-61 | 29600 | 81.3 | 38600 | 76.6 | 1.06 |
56 | 3 | MA0634.1 | ALX3 | MA0634.1 | YYYAATTANV | ALX3 | 4.88e-59 | 4.09e-61 | 31600 | 86.9 | 41700 | 82.8 | 1.05 |
57 | 8 | MA0786.1 | POU3F1 | MA0786.1 | WTATGCWAATKW | POU3F1 | 5.71e-59 | 4.7e-61 | 28900 | 79.5 | 37600 | 74.7 | 1.06 |
58 | 3 | MA0680.2 | Pax7 | MA0680.2 | DWTAATCAATTAWNH | Pax7 | 1.68e-58 | 1.36e-60 | 24300 | 66.7 | 30800 | 61.3 | 1.09 |
59 | 9 | MA0650.3 | Hoxa13 | MA0650.3 | NNCCAATAAAAN | Hoxa13 | 1.82e-58 | 1.45e-60 | 29900 | 82.1 | 39000 | 77.5 | 1.06 |
60 | 10 | MA1104.2 | GATA6 | MA1104.2 | HWWTCTTATCTNH | GATA6 | 1.96e-58 | 1.53e-60 | 27800 | 76.4 | 36000 | 71.4 | 1.07 |
61 | 8 | MA0788.1 | POU3F3 | MA0788.1 | WWTATGCWAATTW | POU3F3 | 2.74e-58 | 2.11e-60 | 26600 | 73 | 34200 | 67.9 | 1.08 |
62 | 10 | MA0035.4 | GATA1 | MA0035.4 | NHCTAATCTDH | GATA1 | 4.3e-58 | 3.25e-60 | 28600 | 78.5 | 37100 | 73.7 | 1.07 |
63 | 8 | MA0785.1 | POU2F1 | MA0785.1 | AWTATGCWAATK | POU2F1 | 5.47e-58 | 4.08e-60 | 28600 | 78.7 | 37200 | 73.9 | 1.07 |
64 | 3 | MA0713.1 | PHOX2A | MA0713.1 | TAATYYAATTA | PHOX2A | 1.96e-57 | 1.43e-59 | 25400 | 69.7 | 32400 | 64.4 | 1.08 |
65 | 3 | MA0628.1 | POU6F1 | MA0628.1 | AYTAATTART | POU6F1 | 2.21e-57 | 1.59e-59 | 28900 | 79.3 | 37500 | 74.5 | 1.06 |
66 | 3 | MA0879.2 | DLX1 | MA0879.2 | VYAATTAN | DLX1 | 4.58e-57 | 3.25e-59 | 33600 | 92.4 | 44900 | 89.2 | 1.04 |
67 | 3 | MA0890.1 | GBX2 | MA0890.1 | RSYAATTARN | GBX2 | 4.65e-57 | 3.25e-59 | 27500 | 75.6 | 35600 | 70.6 | 1.07 |
68 | 3 | MA1476.2 | Dlx5 | MA1476.2 | NDGCAATTAGNN | Dlx5 | 7.13e-57 | 4.92e-59 | 31500 | 86.4 | 41500 | 82.4 | 1.05 |
69 | 3 | MA0904.2 | HOXB5 | MA0904.2 | STAATTAS | HOXB5 | 9.09e-57 | 6.18e-59 | 20800 | 57.1 | 26000 | 51.6 | 1.11 |
70 | 11 | MA0041.2 | FOXD3 | MA0041.2 | WAWGHAAATAAACARY | FOXD3 | 1.58e-56 | 1.06e-58 | 29200 | 80.1 | 38000 | 75.4 | 1.06 |
71 | 11 | MA0845.1 | FOXB1 | MA0845.1 | WATGTAAATAT | FOXB1 | 2.47e-56 | 1.63e-58 | 28000 | 76.8 | 36200 | 71.9 | 1.07 |
72 | 3 | MA0654.1 | ISX | MA0654.1 | YYAATTAR | ISX | 4.62e-56 | 3.01e-58 | 21500 | 59 | 26900 | 53.5 | 1.1 |
73 | 3 | MA0889.1 | GBX1 | MA0889.1 | VCYAATTASY | GBX1 | 1.19e-55 | 7.64e-58 | 26200 | 71.8 | 33600 | 66.7 | 1.08 |
74 | 3 | MA1481.1 | DRGX | MA1481.1 | YYAATTAN | DRGX | 1.44e-55 | 9.12e-58 | 29100 | 80 | 37900 | 75.3 | 1.06 |
75 | 3 | MA0675.1 | NKX6-2 | MA0675.1 | NYMATTAA | NKX6-2 | 2.2e-55 | 1.37e-57 | 33200 | 91.1 | 44100 | 87.7 | 1.04 |
76 | 3 | MA0882.1 | DLX6 | MA0882.1 | VYAATTAN | DLX6 | 2.21e-55 | 1.37e-57 | 33900 | 93 | 45300 | 89.9 | 1.03 |
77 | 3 | MA1502.1 | HOXB8 | MA1502.1 | GYMATTAM | HOXB8 | 8.27e-55 | 5.04e-57 | 19400 | 53.2 | 24000 | 47.7 | 1.12 |
78 | 12 | MA0052.4 | MEF2A | MA0052.4 | DDCTAAAAATAGMHH | MEF2A | 1.19e-54 | 7.17e-57 | 26100 | 71.6 | 33500 | 66.5 | 1.08 |
79 | 8 | MA0789.1 | POU3F4 | MA0789.1 | TATGCWAAT | POU3F4 | 2.47e-54 | 1.47e-56 | 24700 | 68 | 31600 | 62.8 | 1.08 |
80 | 3 | MA0125.1 | Nobox | MA0125.1 | TAATTRSY | Nobox | 3.65e-54 | 2.14e-56 | 30300 | 83.1 | 39700 | 78.8 | 1.05 |
81 | 13 | MA1480.1 | DPRX | MA1480.1 | AGATAATCCN | DPRX | 4.12e-54 | 2.39e-56 | 29300 | 80.4 | 38200 | 75.9 | 1.06 |
82 | 9 | MA0465.2 | CDX2 | MA0465.2 | NDGCAATAAAWN | CDX2 | 4.97e-54 | 2.85e-56 | 29300 | 80.4 | 38200 | 75.9 | 1.06 |
83 | 3 | MA0722.1 | VAX1 | MA0722.1 | YTAATTAH | VAX1 | 6.26e-54 | 3.54e-56 | 33100 | 91 | 44100 | 87.6 | 1.04 |
84 | 14 | MA0635.1 | BARHL2 | MA0635.1 | VHTAAAYGRT | BARHL2 | 8.09e-54 | 4.52e-56 | 31300 | 86 | 41300 | 82 | 1.05 |
85 | 9 | MA0901.2 | HOXB13 | MA0901.2 | WDCCAATAAAAWWW | HOXB13 | 8.43e-54 | 4.66e-56 | 25400 | 69.8 | 32500 | 64.6 | 1.08 |
86 | 9 | MA0899.1 | HOXA10 | MA0899.1 | DGYMATAAAAH | HOXA10 | 1.71e-53 | 9.32e-56 | 25300 | 69.4 | 32400 | 64.3 | 1.08 |
87 | 3 | MA0793.1 | POU6F2 | MA0793.1 | ASCTMATTAW | POU6F2 | 2.02e-53 | 1.08e-55 | 28000 | 76.8 | 36300 | 72.1 | 1.07 |
88 | 3 | MA0158.2 | HOXA5 | MA0158.2 | GYMATTAS | HOXA5 | 2.03e-53 | 1.08e-55 | 17200 | 47.1 | 21000 | 41.7 | 1.13 |
89 | 8 | MA0787.1 | POU3F2 | MA0787.1 | WTATGCWAATKA | POU3F2 | 2.61e-53 | 1.38e-55 | 25100 | 69 | 32200 | 63.9 | 1.08 |
90 | 3 | MA0721.1 | UNCX | MA0721.1 | YYAATTAV | UNCX | 5.57e-53 | 2.9e-55 | 32500 | 89.4 | 43200 | 85.8 | 1.04 |
91 | 3 | MA0876.1 | BSX | MA0876.1 | SYVATTAV | BSX | 1.57e-52 | 8.1e-55 | 30000 | 82.4 | 39300 | 78.2 | 1.05 |
92 | 3 | MA1495.1 | HOXA1 | MA1495.1 | STAATTAS | HOXA1 | 2.96e-52 | 1.51e-54 | 16400 | 45.2 | 20100 | 39.9 | 1.13 |
93 | 3 | MA1505.1 | HOXC8 | MA1505.1 | KTAATTAM | HOXC8 | 3.44e-52 | 1.74e-54 | 32300 | 88.8 | 42900 | 85.1 | 1.04 |
94 | 3 | MA0707.2 | MNX1 | MA0707.2 | VYRATTAK | MNX1 | 5.13e-52 | 2.56e-54 | 19100 | 52.5 | 23800 | 47.2 | 1.11 |
95 | 3 | MA0709.1 | Msx3 | MA0709.1 | SCAATTAN | Msx3 | 6.43e-52 | 3.18e-54 | 28800 | 79.1 | 37600 | 74.6 | 1.06 |
96 | 3 | MA0885.2 | Dlx2 | MA0885.2 | NDGCAATTAGNN | Dlx2 | 1.54e-51 | 7.53e-54 | 32400 | 89.1 | 43100 | 85.6 | 1.04 |
97 | 3 | MA0708.2 | MSX2 | MA0708.2 | SCAATTAV | MSX2 | 1.91e-51 | 9.23e-54 | 27900 | 76.6 | 36200 | 71.9 | 1.06 |
98 | 3 | MA0601.1 | Arid3b | MA0601.1 | ATATTAATWAN | Arid3b | 4.98e-51 | 2.38e-53 | 21600 | 59.2 | 27200 | 54 | 1.1 |
99 | 3 | MA0658.1 | LHX6 | MA0658.1 | RCTAATTARY | LHX6 | 7.17e-51 | 3.4e-53 | 22500 | 61.8 | 28500 | 56.6 | 1.09 |
100 | 10 | MA0036.3 | GATA2 | MA0036.3 | NBCTTATCTNH | GATA2 | 1.37e-50 | 6.42e-53 | 30400 | 83.6 | 40000 | 79.5 | 1.05 |
101 | 15 | MA0791.1 | POU4F3 | MA0791.1 | RTGMATWATTAATGAR | POU4F3 | 1.76e-50 | 8.16e-53 | 21800 | 60 | 27600 | 54.8 | 1.1 |
102 | 3 | MA0716.1 | PRRX1 | MA0716.1 | YYAATTAR | PRRX1 | 2.63e-50 | 1.21e-52 | 33900 | 93.2 | 45500 | 90.3 | 1.03 |
103 | 3 | MA0701.2 | LHX9 | MA0701.2 | CYAATTAR | LHX9 | 3.9e-50 | 1.78e-52 | 15200 | 41.7 | 18400 | 36.6 | 1.14 |
104 | 3 | MA0700.2 | LHX2 | MA0700.2 | DDCCAATTADN | LHX2 | 5.47e-50 | 2.47e-52 | 28400 | 78 | 37000 | 73.5 | 1.06 |
105 | 8 | MA0792.1 | POU5F1B | MA0792.1 | TATGCWAAT | POU5F1B | 5.64e-50 | 2.52e-52 | 31900 | 87.8 | 42300 | 84.1 | 1.04 |
106 | 16 | MA0018.4 (CREB1) | STREME-3 | 3-ATGWCAT | ATGWCAT | CREB1 | 8.62e-50 | 3.82e-52 | 34600 | 94.9 | 46500 | 92.4 | 1.03 |
107 | 11 | MA1606.1 | Foxf1 | MA1606.1 | DWRTAAACANN | Foxf1 | 8.8e-50 | 3.86e-52 | 32600 | 89.6 | 43400 | 86.2 | 1.04 |
108 | 3 | MA0648.1 | GSC | MA0648.1 | VYTAATCCSY | GSC | 1.33e-49 | 5.77e-52 | 28900 | 79.3 | 37700 | 74.9 | 1.06 |
109 | 3 | MA0666.2 | MSX1 | MA0666.2 | SCAATTAR | MSX1 | 2.24e-49 | 9.64e-52 | 24100 | 66.3 | 30800 | 61.2 | 1.08 |
110 | 17 | MA0495.3 | MAFF | MA0495.3 | DWGTCAGCATTTTWHN | MAFF | 2.78e-49 | 1.18e-51 | 31100 | 85.5 | 41100 | 81.6 | 1.05 |
111 | 3 | MA0910.2 | HOXD8 | MA0910.2 | GYMATTAM | HOXD8 | 6.8e-49 | 2.87e-51 | 17200 | 47.4 | 21300 | 42.2 | 1.12 |
112 | 15 | MA0790.1 | POU4F1 | MA0790.1 | ATGMATAATTAATG | POU4F1 | 1.33e-48 | 5.57e-51 | 22100 | 60.7 | 28000 | 55.6 | 1.09 |
113 | 11 | MA0032.2 | FOXC1 | MA0032.2 | WAWGTAAAYAW | FOXC1 | 1.68e-47 | 6.96e-50 | 29200 | 80.3 | 38300 | 76.1 | 1.06 |
114 | 3 | MA1497.1 | HOXA6 | MA1497.1 | BGYMATTANN | HOXA6 | 1.7e-47 | 6.98e-50 | 31100 | 85.4 | 41100 | 81.6 | 1.05 |
115 | 8 | MA0627.2 | POU2F3 | MA0627.2 | NWTATGCAAATNH | POU2F3 | 1.92e-47 | 7.85e-50 | 26700 | 73.2 | 34500 | 68.6 | 1.07 |
116 | 3 | MA0681.2 | PHOX2B | MA0681.2 | WWTAATCAAATTAWWW | PHOX2B | 2.46e-47 | 9.95e-50 | 21000 | 57.6 | 26400 | 52.5 | 1.1 |
117 | 3 | MA0630.1 | SHOX | MA0630.1 | HTAATTRR | SHOX | 3.63e-47 | 1.45e-49 | 34300 | 94.3 | 46100 | 91.6 | 1.03 |
118 | 18 | MA0769.2 | TCF7 | MA0769.2 | NNCTTTGAWVT | TCF7 | 3.64e-47 | 1.45e-49 | 34000 | 93.5 | 45700 | 90.8 | 1.03 |
119 | 3 | MA1549.1 | POU6F1 | MA1549.1 | RTAATGAGVN | POU6F1 | 4.91e-47 | 1.93e-49 | 30100 | 82.6 | 39600 | 78.6 | 1.05 |
120 | 3 | MA0661.1 | MEOX1 | MA0661.1 | VSTAATTAHC | MEOX1 | 5.54e-47 | 2.17e-49 | 31100 | 85.4 | 41100 | 81.6 | 1.05 |
121 | 10 | MA1970.1 | TRPS1 | MA1970.1 | NHTCTTATCTNH | TRPS1 | 5.83e-47 | 2.26e-49 | 30000 | 82.4 | 39500 | 78.4 | 1.05 |
122 | 3 | MA1518.2 | Lhx1 | MA1518.2 | NNNTGCTAATTAGCANNN | Lhx1 | 5.87e-47 | 2.26e-49 | 27300 | 74.9 | 35400 | 70.3 | 1.06 |
123 | 3 | MA0068.2 | PAX4 | MA0068.2 | CTAATTAG | PAX4 | 7.26e-47 | 2.77e-49 | 30900 | 84.9 | 40800 | 81.1 | 1.05 |
124 | 3 | MA1471.1 | BARX2 | MA1471.1 | NWWAAYMATTAN | BARX2 | 8.49e-47 | 3.21e-49 | 28000 | 77 | 36500 | 72.5 | 1.06 |
125 | 3 | MA0912.2 | HOXD3 | MA0912.2 | CTAATTAC | HOXD3 | 3.33e-46 | 1.25e-48 | 13700 | 37.5 | 16500 | 32.7 | 1.15 |
126 | 19 | MA0905.1 | HOXC10 | MA0905.1 | GTCRTAAAAH | HOXC10 | 4.32e-46 | 1.61e-48 | 24500 | 67.3 | 31400 | 62.5 | 1.08 |
127 | 3 | MA0894.1 | HESX1 | MA0894.1 | GHTAATTRRH | HESX1 | 1.53e-45 | 5.65e-48 | 22600 | 62.2 | 28800 | 57.3 | 1.09 |
128 | 3 | MA0706.1 | MEOX2 | MA0706.1 | DSTAATTAWN | MEOX2 | 6.1e-45 | 2.24e-47 | 31800 | 87.4 | 42200 | 83.9 | 1.04 |
129 | 4 | MA0143.4 | SOX2 | MA0143.4 | DNACAATGGNN | SOX2 | 1.26e-44 | 4.59e-47 | 30600 | 84 | 40400 | 80.2 | 1.05 |
130 | 3 | MA0642.2 | EN2 | MA0642.2 | VSYAATTA | EN2 | 3.87e-44 | 1.4e-46 | 28800 | 79.1 | 37700 | 74.9 | 1.06 |
131 | 3 | MA1519.1 | LHX5 | MA1519.1 | STAATTAV | LHX5 | 7.61e-44 | 2.73e-46 | 14500 | 39.8 | 17600 | 35 | 1.14 |
132 | 20 | MA1657.1 | ZNF652 | MA1657.1 | BAAAGRGTTAAN | ZNF652 | 8.5e-44 | 3.02e-46 | 30300 | 83.3 | 40000 | 79.4 | 1.05 |
133 | 21 | MA0122.3 | Nkx3-2 | MA0122.3 | WWAAMCACTTAAN | Nkx3-2 | 9.85e-44 | 3.47e-46 | 27400 | 75.2 | 35600 | 70.8 | 1.06 |
134 | 11 | MA0846.1 | FOXC2 | MA0846.1 | WAWGTAAACAWW | FOXC2 | 1.12e-43 | 3.93e-46 | 30200 | 82.8 | 39700 | 79 | 1.05 |
135 | 11 | MA1978.1 | ZNF354A | MA1978.1 | HWAADWATAAATGRAYWAWTT | ZNF354A | 1.51e-43 | 5.25e-46 | 25100 | 69 | 32400 | 64.3 | 1.07 |
136 | 3 | MA0711.1 | OTX1 | MA0711.1 | HTAATCCS | OTX1 | 2.14e-43 | 7.38e-46 | 31400 | 86.3 | 41600 | 82.7 | 1.04 |
137 | 8 | MA1115.1 | POU5F1 | MA1115.1 | WHATGCAAATN | POU5F1 | 7.01e-43 | 2.4e-45 | 31400 | 86.2 | 41600 | 82.6 | 1.04 |
138 | 3 | MA0903.1 | HOXB3 | MA0903.1 | NNYMATTARN | HOXB3 | 7.34e-43 | 2.5e-45 | 31900 | 87.6 | 42400 | 84.2 | 1.04 |
139 | 22 | MA0047.3 | FOXA2 | MA0047.3 | RWGTAAACANN | FOXA2 | 3e-42 | 1.01e-44 | 32000 | 87.8 | 42500 | 84.4 | 1.04 |
140 | 23 | MA0499.2 (MYOD1) | STREME-5 | 5-AGACAGG | AGACAGG | MYOD1 | 6.66e-42 | 2.23e-44 | 27900 | 76.6 | 36500 | 72.4 | 1.06 |
141 | 3 | MA0892.1 | GSX1 | MA0892.1 | NCYMATTAVN | GSX1 | 8.32e-42 | 2.77e-44 | 30600 | 84 | 40400 | 80.3 | 1.05 |
142 | 3 | MA0705.1 | Lhx8 | MA0705.1 | CTAATTAV | Lhx8 | 1.06e-41 | 3.52e-44 | 19700 | 54.2 | 24900 | 49.4 | 1.1 |
143 | 22 | MA0148.4 | FOXA1 | MA0148.4 | WWGTAAACATNN | FOXA1 | 1.32e-41 | 4.33e-44 | 31500 | 86.5 | 41800 | 83.1 | 1.04 |
144 | 4 | MA1152.1 | SOX15 | MA1152.1 | CYWTTGTTHW | SOX15 | 5e-41 | 1.63e-43 | 33500 | 91.9 | 44900 | 89.1 | 1.03 |
145 | 22 | MA0031.1 | FOXD1 | MA0031.1 | GTAAACAW | FOXD1 | 6.59e-41 | 2.13e-43 | 21900 | 60 | 27900 | 55.4 | 1.09 |
146 | 11 | MA1607.1 | Foxl2 | MA1607.1 | DDWATGTAAACANN | Foxl2 | 7.51e-41 | 2.41e-43 | 31500 | 86.5 | 41800 | 83.1 | 1.04 |
147 | 22 | MA1683.1 | FOXA3 | MA1683.1 | DWGTAAACANN | FOXA3 | 2.5e-40 | 7.99e-43 | 30300 | 83.2 | 40000 | 79.4 | 1.05 |
148 | 3 | MA1530.1 | NKX6-3 | MA1530.1 | VYCATTAWW | NKX6-3 | 4.47e-40 | 1.42e-42 | 19800 | 54.5 | 25100 | 49.8 | 1.09 |
149 | 10 | MA0037.4 | Gata3 | MA0037.4 | NHTCTTATCTNH | Gata3 | 7.08e-40 | 2.23e-42 | 31700 | 87.2 | 42200 | 83.8 | 1.04 |
150 | 4 | MA0077.1 | SOX9 | MA0077.1 | CYATTGTTY | SOX9 | 3.74e-39 | 1.17e-41 | 34900 | 95.8 | 47200 | 93.8 | 1.02 |
151 | 3 | MA0875.1 | BARX1 | MA0875.1 | VCMATTAV | BARX1 | 8.22e-39 | 2.55e-41 | 27900 | 76.7 | 36600 | 72.6 | 1.06 |
152 | 3 | MA0075.3 | PRRX2 | MA0075.3 | CTAATTAN | PRRX2 | 3.82e-38 | 1.18e-40 | 14900 | 40.9 | 18300 | 36.4 | 1.12 |
153 | 24 | MA0108.2 | TBP | MA0108.2 | STATAAAWRSVVVBN | TBP | 6.16e-38 | 1.89e-40 | 29100 | 79.9 | 38300 | 76.1 | 1.05 |
154 | 15 | MA0153.2 | HNF1B | MA0153.2 | GTTAATNATTAAY | HNF1B | 9.18e-38 | 2.8e-40 | 15200 | 41.7 | 18700 | 37.2 | 1.12 |
155 | 3 | MA0612.2 | EMX1 | MA0612.2 | BTAATTAS | EMX1 | 1.17e-37 | 3.55e-40 | 11600 | 31.9 | 14000 | 27.7 | 1.15 |
156 | 25 | MA0895.1 | HMBOX1 | MA0895.1 | MYTAGTTAMS | HMBOX1 | 1.76e-37 | 5.31e-40 | 32400 | 88.9 | 43200 | 85.8 | 1.04 |
157 | 3 | MA0900.2 | HOXA2 | MA0900.2 | STAATTAS | HOXA2 | 2.58e-37 | 7.71e-40 | 14000 | 38.6 | 17200 | 34.2 | 1.13 |
158 | 26 | MA0025.2 | NFIL3 | MA0025.2 | NRTTATGYAATNN | NFIL3 | 3.07e-37 | 9.11e-40 | 28900 | 79.2 | 38000 | 75.4 | 1.05 |
159 | 10 | MA0766.2 | GATA5 | MA0766.2 | SWGATAASRN | GATA5 | 4.31e-37 | 1.27e-39 | 17200 | 47.3 | 21500 | 42.8 | 1.11 |
160 | 3 | MA0717.1 | RAX2 | MA0717.1 | YYAATTAV | RAX2 | 6.66e-37 | 1.95e-39 | 34800 | 95.5 | 47000 | 93.4 | 1.02 |
161 | 3 | MA0891.1 | GSC2 | MA0891.1 | NYTAATCCBH | GSC2 | 9.95e-37 | 2.9e-39 | 33200 | 91.2 | 44500 | 88.5 | 1.03 |
162 | 22 | MA0481.3 | FOXP1 | MA0481.3 | NDGTAAACANH | FOXP1 | 1.22e-36 | 3.52e-39 | 28600 | 78.5 | 37600 | 74.7 | 1.05 |
163 | 3 | MA1463.1 | ARGFX | MA1463.1 | DCTAATTARM | ARGFX | 1.29e-36 | 3.71e-39 | 15900 | 43.6 | 19700 | 39.2 | 1.11 |
164 | 11 | MA0157.3 | Foxo3 | MA0157.3 | NDGTAAACARNN | Foxo3 | 1.68e-36 | 4.81e-39 | 31800 | 87.3 | 42400 | 84.2 | 1.04 |
165 | 13 | MA0682.2 | PITX1 | MA0682.2 | YTAATCCH | PITX1 | 2.44e-36 | 6.93e-39 | 19000 | 52 | 23900 | 47.6 | 1.09 |
166 | 15 | MA0046.2 | HNF1A | MA0046.2 | DRTTAATNATTAACN | HNF1A | 2.45e-36 | 6.93e-39 | 15500 | 42.5 | 19200 | 38.1 | 1.12 |
167 | 4 | MA1563.2 | SOX18 | MA1563.2 | AACAATDV | SOX18 | 3.7e-36 | 1.04e-38 | 27600 | 75.9 | 36200 | 72 | 1.05 |
168 | 27 | MA0602.1 | Arid5a | MA0602.1 | SYAATATTGVDANH | Arid5a | 4.41e-36 | 1.23e-38 | 22800 | 62.6 | 29300 | 58.2 | 1.08 |
169 | 3 | MA1608.1 | Isl1 | MA1608.1 | NNCCATTAGNN | Isl1 | 5.1e-36 | 1.42e-38 | 33700 | 92.7 | 45400 | 90.2 | 1.03 |
170 | 3 | MA0898.1 | Hmx3 | MA0898.1 | ASAAGCAATTAAWRRAT | Hmx3 | 6.69e-36 | 1.85e-38 | 21600 | 59.4 | 27700 | 55 | 1.08 |
171 | 12 | MA0497.1 | MEF2C | MA0497.1 | DDDCYAAAAATAGMW | MEF2C | 7.79e-36 | 2.14e-38 | 22900 | 62.9 | 29400 | 58.5 | 1.07 |
172 | 28 | MA0809.2 | TEAD4 | MA0809.2 | NNACATTCCARN | TEAD4 | 9.33e-36 | 2.55e-38 | 27600 | 75.9 | 36200 | 72 | 1.05 |
173 | 22 | MA0033.2 | FOXL1 | MA0033.2 | RTAAACA | FOXL1 | 1.15e-35 | 3.12e-38 | 29600 | 81.4 | 39200 | 77.8 | 1.05 |
174 | 4 | MA0868.2 | SOX8 | MA0868.2 | MGAACAATRG | SOX8 | 1.22e-35 | 3.3e-38 | 33200 | 91.2 | 44500 | 88.4 | 1.03 |
175 | 13 | MA0712.2 | OTX2 | MA0712.2 | NDRGGATTARNN | OTX2 | 1.48e-35 | 3.97e-38 | 26500 | 72.8 | 34600 | 68.7 | 1.06 |
176 | 3 | MA0662.1 | MIXL1 | MA0662.1 | NBYAATTAVN | MIXL1 | 3.59e-35 | 9.58e-38 | 34600 | 95.1 | 46800 | 93 | 1.02 |
177 | 4 | MA1120.1 | SOX13 | MA1120.1 | DVACAATGGNN | SOX13 | 1.29e-34 | 3.43e-37 | 30600 | 84.1 | 40700 | 80.8 | 1.04 |
178 | 9 | MA0909.3 | Hoxd13 | MA0909.3 | NNCAATAAAW | Hoxd13 | 2.39e-34 | 6.31e-37 | 22100 | 60.7 | 28400 | 56.4 | 1.08 |
179 | 29 | 7-AGAATA | STREME-7 | 7-AGAATA | AGAATA | AGAATA | 5e-34 | 1.31e-36 | 32800 | 90.1 | 44000 | 87.4 | 1.03 |
180 | 22 | MA0042.2 | FOXI1 | MA0042.2 | GTAAACA | FOXI1 | 5.6e-34 | 1.46e-36 | 26200 | 72 | 34300 | 68 | 1.06 |
181 | 30 | MA0018.4 | CREB1 | MA0018.4 | NNRTGATGTCAYN | CREB1 | 1.09e-33 | 2.83e-36 | 28100 | 77.1 | 36900 | 73.3 | 1.05 |
182 | 22 | MA1103.2 | FOXK2 | MA1103.2 | DDGTAAACANN | FOXK2 | 1.84e-33 | 4.75e-36 | 34000 | 93.4 | 45900 | 91.1 | 1.03 |
183 | 3 | MA0888.1 | EVX2 | MA0888.1 | VVTAATTAVB | EVX2 | 2.16e-33 | 5.53e-36 | 33000 | 90.5 | 44200 | 87.8 | 1.03 |
184 | 31 | MA1125.1 (ZNF384) | MEME-1 | THTTTTTKTWTTTTT | THTTTTTKTWTTTTT | ZNF384 | 3.33e-33 | 8.48e-36 | 22600 | 62 | 29100 | 57.8 | 1.07 |
185 | 4 | MA0514.2 | Sox3 | MA0514.2 | DNACAATGGNN | Sox3 | 4.04e-33 | 1.02e-35 | 32100 | 88.2 | 43000 | 85.3 | 1.03 |
186 | 22 | MA0848.1 | FOXO4 | MA0848.1 | GTAAACA | FOXO4 | 4.17e-33 | 1.05e-35 | 31300 | 86 | 41700 | 82.8 | 1.04 |
187 | 4 | MA0078.2 | Sox17 | MA0078.2 | NNAGAACAATGGNN | Sox17 | 8.89e-33 | 2.23e-35 | 30900 | 84.9 | 41100 | 81.7 | 1.04 |
188 | 26 | MA0102.4 | CEBPA | MA0102.4 | NDATTGCACAATHN | CEBPA | 1.04e-32 | 2.59e-35 | 30000 | 82.3 | 39700 | 79 | 1.04 |
189 | 3 | MA0881.1 | Dlx4 | MA0881.1 | VYAATTAN | Dlx4 | 1.06e-32 | 2.64e-35 | 35200 | 96.6 | 47800 | 94.9 | 1.02 |
190 | 3 | MA0914.1 | ISL2 | MA0914.1 | GCAMTTAR | ISL2 | 1.28e-32 | 3.15e-35 | 30500 | 83.7 | 40500 | 80.4 | 1.04 |
191 | 4 | MA1562.1 | SOX14 | MA1562.1 | CCGAACAATG | SOX14 | 1.66e-32 | 4.09e-35 | 27200 | 74.7 | 35700 | 70.9 | 1.05 |
192 | 3 | MA1498.2 | HOXA7 | MA1498.2 | SYMATTAV | HOXA7 | 1.84e-32 | 4.48e-35 | 34300 | 94.3 | 46400 | 92.1 | 1.02 |
193 | 18 | MA0523.1 | TCF7L2 | MA0523.1 | DRASATCAAAGRVA | TCF7L2 | 2.31e-32 | 5.62e-35 | 22800 | 62.6 | 29400 | 58.5 | 1.07 |
194 | 3 | MA0893.2 | GSX2 | MA0893.2 | SYMATTAR | GSX2 | 3.24e-32 | 7.83e-35 | 30700 | 84.4 | 40800 | 81.1 | 1.04 |
195 | 3 | MA0132.2 | PDX1 | MA0132.2 | VYAATTAR | PDX1 | 1.26e-31 | 3.03e-34 | 20100 | 55.2 | 25700 | 51.1 | 1.08 |
196 | 17 | MA0842.2 | NRL | MA0842.2 | AAWWNTGCTGACG | NRL | 3.06e-31 | 7.32e-34 | 14900 | 41 | 18600 | 36.9 | 1.11 |
197 | 13 | MA0467.2 | Crx | MA0467.2 | NDGGATTANN | Crx | 1.31e-30 | 3.12e-33 | 26500 | 72.7 | 34700 | 68.9 | 1.05 |
198 | 3 | MA0854.1 | Alx1 | MA0854.1 | CGMRYTAATTARTNMNY | Alx1 | 1.33e-30 | 3.15e-33 | 15000 | 41.3 | 18800 | 37.3 | 1.11 |
199 | 32 | MA0755.1 | CUX2 | MA0755.1 | TRATCGATAW | CUX2 | 1.55e-30 | 3.65e-33 | 17100 | 46.9 | 21500 | 42.8 | 1.1 |
200 | 32 | MA0679.2 | ONECUT1 | MA0679.2 | WAAAAATCAATAAWWN | ONECUT1 | 1.64e-30 | 3.84e-33 | 14800 | 40.6 | 18400 | 36.6 | 1.11 |
201 | 33 | MA0070.1 | PBX1 | MA0070.1 | HCATCAATCAAW | PBX1 | 5.35e-30 | 1.25e-32 | 15500 | 42.6 | 19400 | 38.6 | 1.1 |
202 | 3 | MA0618.1 | LBX1 | MA0618.1 | TTAATTAG | LBX1 | 5.94e-30 | 1.38e-32 | 19800 | 54.3 | 25300 | 50.2 | 1.08 |
203 | 28 | MA0090.3 | TEAD1 | MA0090.3 | NNACATTCCAGSN | TEAD1 | 1.36e-29 | 3.15e-32 | 29300 | 80.6 | 38900 | 77.2 | 1.04 |
204 | 26 | MA1636.1 | CEBPG | MA1636.1 | NDATGATGCAATMHH | CEBPG | 4.52e-28 | 1.04e-30 | 23100 | 63.6 | 30100 | 59.7 | 1.06 |
205 | 34 | MA0151.1 | Arid3a | MA0151.1 | ATYAAA | Arid3a | 4.71e-28 | 1.08e-30 | 15000 | 41.1 | 18700 | 37.2 | 1.1 |
206 | 1 | MA0591.1 | Bach1::Mafk | MA0591.1 | RSSATGACTCAGCAB | Bach1::Mafk | 6.68e-28 | 1.52e-30 | 2740 | 7.54 | 2820 | 5.59 | 1.35 |
207 | 35 | MA0483.1 | Gfi1B | MA0483.1 | AAATCWCWGCH | Gfi1B | 1.56e-27 | 3.54e-30 | 20100 | 55.3 | 25900 | 51.4 | 1.08 |
208 | 11 | MA0852.2 | FOXK1 | MA0852.2 | NRDGTAAACAAGNN | FOXK1 | 2.71e-27 | 6.11e-30 | 32300 | 88.8 | 43400 | 86.2 | 1.03 |
209 | 9 | MA0878.3 | CDX1 | MA0878.3 | GGYMATAAAA | CDX1 | 3.01e-27 | 6.77e-30 | 14300 | 39.4 | 17900 | 35.6 | 1.11 |
210 | 3 | MA1499.1 | HOXB4 | MA1499.1 | RTMATTAR | HOXB4 | 4.48e-27 | 1e-29 | 15800 | 43.4 | 19900 | 39.6 | 1.1 |
211 | 21 | MA0124.2 | Nkx3-1 | MA0124.2 | RCCACTTAA | Nkx3-1 | 5.92e-27 | 1.32e-29 | 27700 | 76.2 | 36600 | 72.8 | 1.05 |
212 | 26 | MA0836.2 | CEBPD | MA0836.2 | NRTTGCACAAYHN | CEBPD | 9.6e-27 | 2.13e-29 | 28300 | 77.6 | 37400 | 74.3 | 1.04 |
213 | 3 | MA0880.1 | Dlx3 | MA0880.1 | NYAATTAN | Dlx3 | 1.61e-26 | 3.55e-29 | 35200 | 96.8 | 48000 | 95.3 | 1.02 |
214 | 3 | MA0724.1 | VENTX | MA0724.1 | AMCGATTAR | VENTX | 3.14e-26 | 6.89e-29 | 12500 | 34.3 | 15400 | 30.7 | 1.12 |
215 | 17 | MA0117.2 | Mafb | MA0117.2 | AWWNTGCTGACD | Mafb | 4.65e-26 | 1.01e-28 | 32300 | 88.8 | 43400 | 86.2 | 1.03 |
216 | 3 | MA1500.1 | HOXB6 | MA1500.1 | BTAATKRC | HOXB6 | 5.6e-26 | 1.22e-28 | 35300 | 97 | 48100 | 95.5 | 1.02 |
217 | 13 | MA0714.1 | PITX3 | MA0714.1 | NYTAATCCC | PITX3 | 5.89e-26 | 1.27e-28 | 35300 | 96.9 | 48100 | 95.5 | 1.02 |
218 | 3 | MA0853.1 | Alx4 | MA0853.1 | CGCRYTAATTARYHNNY | Alx4 | 1.18e-24 | 2.54e-27 | 13900 | 38.1 | 17400 | 34.5 | 1.1 |
219 | 22 | MA1489.1 | FOXN3 | MA1489.1 | GTAAACAA | FOXN3 | 1.57e-24 | 3.36e-27 | 23100 | 63.4 | 30100 | 59.8 | 1.06 |
220 | 36 | MA1623.1 | Stat2 | MA1623.1 | RGAAACAGAAASH | Stat2 | 2.28e-24 | 4.86e-27 | 32000 | 88 | 43000 | 85.5 | 1.03 |
221 | 3 | MA1577.1 | TLX2 | MA1577.1 | BYAATTAR | TLX2 | 2.41e-24 | 5.11e-27 | 12100 | 33.2 | 15000 | 29.8 | 1.12 |
222 | 37 | MA0611.2 | Dux | MA0611.2 | NTGATTBAATCAVWNN | Dux | 6.11e-24 | 1.29e-26 | 11700 | 32.2 | 14500 | 28.8 | 1.12 |
223 | 18 | MA1991.1 | Hnf1A | MA1991.1 | NBCCTTTGATSTBN | Hnf1A | 2.26e-23 | 4.76e-26 | 35100 | 96.4 | 47800 | 94.9 | 1.02 |
224 | 22 | MA0480.2 | Foxo1 | MA0480.2 | NDGTAAACANN | Foxo1 | 2.66e-23 | 5.58e-26 | 34600 | 95 | 47000 | 93.3 | 1.02 |
225 | 38 | MA0606.2 | Nfat5 | MA0606.2 | NNATGGAAAAWN | Nfat5 | 4.78e-23 | 9.97e-26 | 31300 | 85.9 | 42000 | 83.3 | 1.03 |
226 | 22 | MA0850.1 | FOXP3 | MA0850.1 | RTAAACA | FOXP3 | 7.55e-23 | 1.57e-25 | 33900 | 93.2 | 45900 | 91.2 | 1.02 |
227 | 39 | MA1603.1 | Dmrt1 | MA1603.1 | GHTACAAAGTADC | Dmrt1 | 9.66e-23 | 2e-25 | 22800 | 62.7 | 29800 | 59.2 | 1.06 |
228 | 40 | MA0782.2 | PKNOX1 | MA0782.2 | NNTGAGTGACAGVNN | PKNOX1 | 1.17e-22 | 2.41e-25 | 16400 | 45.2 | 21000 | 41.6 | 1.09 |
229 | 13 | MA0719.1 | RHOXF1 | MA0719.1 | NTRATCCN | RHOXF1 | 1.41e-22 | 2.89e-25 | 30800 | 84.5 | 41200 | 81.8 | 1.03 |
230 | 28 | MA0808.1 | TEAD3 | MA0808.1 | RCATTCCW | TEAD3 | 4.39e-22 | 8.95e-25 | 15400 | 42.2 | 19500 | 38.8 | 1.09 |
231 | 32 | MA0754.2 | CUX1 | MA0754.2 | TAATCGATAM | CUX1 | 5.44e-22 | 1.1e-24 | 16700 | 45.8 | 21300 | 42.3 | 1.08 |
232 | 30 | MA1632.1 | ATF2 | MA1632.1 | NNATGABGTCATN | ATF2 | 1.65e-21 | 3.33e-24 | 24700 | 67.9 | 32500 | 64.6 | 1.05 |
233 | 41 | MA0095.3 | Yy1 | MA0095.3 | NNCAAAATGGCN | Yy1 | 1.74e-21 | 3.5e-24 | 33700 | 92.7 | 45700 | 90.8 | 1.02 |
234 | 3 | MA1504.1 | HOXC4 | MA1504.1 | VTMATTAN | HOXC4 | 3.73e-21 | 7.48e-24 | 17200 | 47.3 | 22100 | 43.9 | 1.08 |
235 | 3 | MA1507.1 | HOXD4 | MA1507.1 | VTMATTAD | HOXD4 | 4.37e-21 | 8.72e-24 | 11500 | 31.6 | 14300 | 28.4 | 1.11 |
236 | 13 | MA0883.1 | Dmbx1 | MA0883.1 | WDRWNMGGATTADKNNW | Dmbx1 | 6.37e-21 | 1.27e-23 | 18500 | 50.8 | 23900 | 47.4 | 1.07 |
237 | 42 | MA1974.1 | ZNF211 | MA1974.1 | HWYATATACCAYRYW | ZNF211 | 7.63e-21 | 1.51e-23 | 16900 | 46.3 | 21600 | 42.9 | 1.08 |
238 | 3 | MA0715.1 | PROP1 | MA0715.1 | TAATYWAATTA | PROP1 | 7.68e-21 | 1.51e-23 | 14200 | 39 | 18000 | 35.7 | 1.09 |
239 | 26 | MA0043.3 | HLF | MA0043.3 | NNRTTATGCAACHN | HLF | 1.18e-20 | 2.32e-23 | 22400 | 61.5 | 29300 | 58.2 | 1.06 |
240 | 3 | MA1496.1 | HOXA4 | MA1496.1 | RTMATTAV | HOXA4 | 1.48e-20 | 2.89e-23 | 15100 | 41.4 | 19200 | 38.1 | 1.09 |
241 | 36 | MA0050.3 | Irf1 | MA0050.3 | ANTGAAACTGAAASH | Irf1 | 2.62e-20 | 5.1e-23 | 26900 | 73.9 | 35700 | 70.8 | 1.04 |
242 | 3 | MA1501.1 | HOXB7 | MA1501.1 | GGTAATTAVS | HOXB7 | 6.07e-20 | 1.18e-22 | 28900 | 79.3 | 38500 | 76.5 | 1.04 |
243 | 26 | MA0833.2 | ATF4 | MA0833.2 | DDATGATGCAATMN | ATF4 | 8.63e-20 | 1.67e-22 | 19400 | 53.3 | 25100 | 49.9 | 1.07 |
244 | 43 | MA1118.1 | SIX1 | MA1118.1 | GWAACCTGAKM | SIX1 | 1.42e-19 | 2.73e-22 | 29900 | 82.2 | 40100 | 79.6 | 1.03 |
245 | 44 | MA0626.1 | Npas2 | MA0626.1 | NSCACGTGTN | Npas2 | 2.71e-19 | 5.2e-22 | 17300 | 47.4 | 22200 | 44.1 | 1.07 |
246 | 40 | MA0774.1 | MEIS2 | MA0774.1 | TTGACAGS | MEIS2 | 7.32e-19 | 1.4e-21 | 5800 | 15.9 | 6850 | 13.6 | 1.17 |
247 | 3 | MA0699.1 | LBX2 | MA0699.1 | NCYAATTARN | LBX2 | 1.03e-18 | 1.96e-21 | 35100 | 96.4 | 47800 | 95 | 1.01 |
248 | 3 | MA1960.1 | MGA::EVX1 | MA1960.1 | RGGTGWTAATKD | MGA::EVX1 | 1.23e-18 | 2.33e-21 | 21200 | 58.2 | 27700 | 54.9 | 1.06 |
249 | 22 | MA0614.1 | Foxj2 | MA0614.1 | RTAAACAA | Foxj2 | 2.09e-18 | 3.95e-21 | 18800 | 51.6 | 24400 | 48.4 | 1.07 |
250 | 30 | MA0498.2 | MEIS1 | MA0498.2 | HTGACAD | MEIS1 | 2.51e-18 | 4.7e-21 | 14800 | 40.7 | 18900 | 37.6 | 1.08 |
251 | 4 | MA0867.2 | SOX4 | MA0867.2 | RAACAAAGRV | SOX4 | 3.7e-18 | 6.91e-21 | 27800 | 76.4 | 37100 | 73.7 | 1.04 |
252 | 19 | MA0911.1 | Hoxa11 | MA0911.1 | RGTCGTAAAANT | Hoxa11 | 4.07e-18 | 7.58e-21 | 17100 | 46.8 | 22000 | 43.6 | 1.07 |
253 | 10 | MA0482.2 | GATA4 | MA0482.2 | NNCCTTATCTNH | GATA4 | 5.32e-18 | 9.86e-21 | 10900 | 30.1 | 13700 | 27.2 | 1.11 |
254 | 45 | MA0144.2 | STAT3 | MA0144.2 | YTTCYKGGAAD | STAT3 | 7.51e-18 | 1.39e-20 | 22700 | 62.5 | 29900 | 59.4 | 1.05 |
255 | 30 | MA1131.1 | FOSL2::JUN | MA1131.1 | GRTGACGTMAT | FOSL2::JUN | 1.43e-17 | 2.62e-20 | 24900 | 68.5 | 33000 | 65.5 | 1.05 |
256 | 46 | MA1479.1 | DMRTC2 | MA1479.1 | AATTGMTACATT | DMRTC2 | 1.7e-17 | 3.12e-20 | 26900 | 73.9 | 35800 | 71.1 | 1.04 |
257 | 11 | MA1125.1 | ZNF384 | MA1125.1 | DNWMAAAAAAAA | ZNF384 | 3.65e-17 | 6.66e-20 | 17200 | 47.2 | 22200 | 44 | 1.07 |
258 | 4 | MA0087.2 | Sox5 | MA0087.2 | VRVAACAATGGNN | Sox5 | 4.41e-17 | 8.02e-20 | 26400 | 72.5 | 35100 | 69.7 | 1.04 |
259 | 3 | MA0874.1 | Arx | MA0874.1 | VTBCRYTAATTARTKSW | Arx | 4.91e-17 | 8.9e-20 | 12400 | 34 | 15700 | 31.1 | 1.09 |
260 | 47 | MA1150.1 | RORB | MA1150.1 | AWWTRGGTCAH | RORB | 2.2e-16 | 3.98e-19 | 34900 | 95.8 | 47500 | 94.4 | 1.01 |
261 | 11 | MA0851.1 | Foxj3 | MA0851.1 | NDAADGTAAACAAANNM | Foxj3 | 2.27e-16 | 4.07e-19 | 17500 | 48 | 22600 | 44.9 | 1.07 |
262 | 3 | MA0887.1 | EVX1 | MA0887.1 | VVTAATTABS | EVX1 | 3.44e-16 | 6.16e-19 | 34800 | 95.6 | 47400 | 94.2 | 1.01 |
263 | 8 | MA1962.1 | POU2F1::SOX2 | MA1962.1 | MATTTRCATMACAATRG | POU2F1::SOX2 | 3.63e-16 | 6.47e-19 | 10400 | 28.7 | 13100 | 26 | 1.1 |
264 | 11 | MA0847.3 | FOXD2 | MA0847.3 | SYTAARYAAACA | FOXD2 | 3.82e-16 | 6.78e-19 | 20800 | 57.2 | 27300 | 54.2 | 1.06 |
265 | 48 | MA1593.1 | ZNF317 | MA1593.1 | WVACAGCAGAYW | ZNF317 | 1.04e-15 | 1.84e-18 | 32400 | 88.9 | 43800 | 86.9 | 1.02 |
266 | 49 | MA1523.1 | MSANTD3 | MA1523.1 | STVCACTCAC | MSANTD3 | 1.05e-15 | 1.85e-18 | 31000 | 85.1 | 41700 | 82.8 | 1.03 |
267 | 46 | MA1707.1 | DMRTA1 | MA1707.1 | AWTTGWTACAWT | DMRTA1 | 2.03e-15 | 3.56e-18 | 27700 | 76 | 37000 | 73.4 | 1.04 |
268 | 40 | MA1114.1 | PBX3 | MA1114.1 | NNNTGAGTGACAGGBNN | PBX3 | 6.06e-15 | 1.06e-17 | 21000 | 57.7 | 27600 | 54.8 | 1.05 |
269 | 43 | MA1119.1 | SIX2 | MA1119.1 | NNCTGAAACCTGATMY | SIX2 | 7.31e-15 | 1.27e-17 | 31800 | 87.4 | 43000 | 85.4 | 1.02 |
270 | 19 | MA0907.1 | HOXC13 | MA0907.1 | BCTCGTAAAAH | HOXC13 | 7.6e-15 | 1.32e-17 | 16900 | 46.5 | 21900 | 43.6 | 1.07 |
271 | 19 | MA0906.1 | HOXC12 | MA0906.1 | RGTCGTAAAAH | HOXC12 | 2.69e-14 | 4.64e-17 | 12800 | 35 | 16300 | 32.3 | 1.08 |
272 | 10 | MA0140.2 | GATA1::TAL1 | MA0140.2 | MTTATCWSNNNNNHVCAG | GATA1::TAL1 | 2.69e-14 | 4.64e-17 | 10700 | 29.4 | 13500 | 26.8 | 1.1 |
273 | 36 | MA0152.2 | Nfatc2 | MA0152.2 | NNRNAATGGAAANWNH | Nfatc2 | 6.76e-14 | 1.16e-16 | 33800 | 93 | 46000 | 91.4 | 1.02 |
274 | 50 | MA0092.1 | Hand1::Tcf3 | MA0092.1 | BRTCTGGMWT | Hand1::Tcf3 | 7.85e-14 | 1.34e-16 | 18400 | 50.5 | 24000 | 47.6 | 1.06 |
275 | 30 | MA1127.1 | FOSB::JUN | MA1127.1 | GATGACGTCAT | FOSB::JUN | 1.23e-13 | 2.09e-16 | 27000 | 74.2 | 36100 | 71.7 | 1.03 |
276 | 30 | MA0093.3 | USF1 | MA0093.3 | NDGTCATGTGACHN | USF1 | 3.22e-13 | 5.48e-16 | 21200 | 58.2 | 27900 | 55.4 | 1.05 |
277 | 38 | MA0624.2 | Nfatc1 | MA0624.2 | NVATGGAAAAWN | Nfatc1 | 3.29e-13 | 5.58e-16 | 33000 | 90.7 | 44800 | 89 | 1.02 |
278 | 28 | MA1121.1 | TEAD2 | MA1121.1 | NYACATTCCWNNS | TEAD2 | 4.64e-13 | 7.84e-16 | 24300 | 66.8 | 32300 | 64.2 | 1.04 |
279 | 51 | MA1524.2 | Msgn1 | MA1524.2 | VRRRACAAATGGTNNN | Msgn1 | 5.09e-13 | 8.56e-16 | 21200 | 58.3 | 28000 | 55.5 | 1.05 |
280 | 18 | MA0768.2 | Lef1 | MA0768.2 | NNBCCTTTGATSTN | Lef1 | 6.13e-13 | 1.03e-15 | 15800 | 43.4 | 20500 | 40.7 | 1.07 |
281 | 3 | MA0896.1 | Hmx1 | MA0896.1 | VSVAGCAATTAAWGVDB | Hmx1 | 2.4e-12 | 4.01e-15 | 9450 | 26 | 11900 | 23.6 | 1.1 |
282 | 30 | MA1112.2 | NR4A1 | MA1112.2 | NNAAAGGTCANN | NR4A1 | 2.55e-12 | 4.24e-15 | 23700 | 65 | 31400 | 62.4 | 1.04 |
283 | 9 | MA1645.1 | NKX2-2 | MA1645.1 | NNNCCACTCAANNN | NKX2-2 | 2.57e-12 | 4.26e-15 | 33100 | 90.9 | 45000 | 89.3 | 1.02 |
284 | 52 | MA1588.1 | ZNF136 | MA1588.1 | RWATTCTTGGTTGRC | ZNF136 | 8.44e-12 | 1.39e-14 | 11400 | 31.4 | 14600 | 29 | 1.08 |
285 | 4 | MA0442.2 | SOX10 | MA0442.2 | NDAACAAAGVN | SOX10 | 1.88e-11 | 3.09e-14 | 18300 | 50.1 | 23900 | 47.6 | 1.05 |
286 | 30 | MA1139.1 | FOSL2::JUNB | MA1139.1 | DATGACGTCATH | FOSL2::JUNB | 4.63e-11 | 7.6e-14 | 19700 | 54.2 | 26000 | 51.7 | 1.05 |
287 | 36 | MA1953.1 | FOXO1::ELF1 | MA1953.1 | RTMAACAGGAAGTN | FOXO1::ELF1 | 5.4e-11 | 8.82e-14 | 31900 | 87.6 | 43300 | 85.9 | 1.02 |
288 | 22 | MA0849.1 | FOXO6 | MA0849.1 | GTAAACA | FOXO6 | 6.09e-11 | 9.93e-14 | 2880 | 7.9 | 3310 | 6.58 | 1.2 |
289 | 21 | MA0063.2 | NKX2-5 | MA0063.2 | NNCACTCAANN | NKX2-5 | 8.29e-11 | 1.35e-13 | 35500 | 97.4 | 48600 | 96.5 | 1.01 |
290 | 37 | MA0884.2 | DUXA | MA0884.2 | YTRATTWAATYAR | DUXA | 8.61e-11 | 1.39e-13 | 5960 | 16.4 | 7320 | 14.6 | 1.13 |
291 | 53 | MA0596.1 | SREBF2 | MA0596.1 | RTGGGGTGAY | SREBF2 | 8.75e-11 | 1.41e-13 | 7110 | 19.5 | 8840 | 17.6 | 1.11 |
292 | 12 | MA0773.1 | MEF2D | MA0773.1 | DCTAWAAATAGM | MEF2D | 1.07e-10 | 1.71e-13 | 4610 | 12.7 | 5560 | 11 | 1.15 |
293 | 54 | MA1975.1 | ZNF214 | MA1975.1 | VWTCATCAABGTCCT | ZNF214 | 1.08e-10 | 1.74e-13 | 22500 | 61.7 | 29800 | 59.2 | 1.04 |
294 | 45 | MA1624.1 | Stat5a | MA1624.1 | NTTCCAAGAANH | Stat5a | 1.26e-10 | 2.01e-13 | 28900 | 79.4 | 39000 | 77.4 | 1.03 |
295 | 30 | MA1111.1 | NR2F2 | MA1111.1 | NAAAGGTCANR | NR2F2 | 1.56e-10 | 2.47e-13 | 31200 | 85.8 | 42300 | 84 | 1.02 |
296 | 35 | MA0038.2 | GFI1 | MA0038.2 | BMAATCACDGCA | GFI1 | 2.1e-10 | 3.33e-13 | 6610 | 18.2 | 8200 | 16.3 | 1.12 |
297 | 15 | MA0683.1 | POU4F2 | MA0683.1 | RTGCATAATTAATGAG | POU4F2 | 4.32e-10 | 6.83e-13 | 6170 | 17 | 7630 | 15.2 | 1.12 |
298 | 51 | MA0620.3 | MITF | MA0620.3 | NNNDRTCACGTGAYHNNN | MITF | 9.29e-10 | 1.46e-12 | 18500 | 50.9 | 24400 | 48.5 | 1.05 |
299 | 55 | MA0508.3 | PRDM1 | MA0508.3 | YNCTTTCTCTH | PRDM1 | 1.72e-09 | 2.69e-12 | 28400 | 77.9 | 38200 | 75.9 | 1.03 |
300 | 9 | MA1644.1 | NFYC | MA1644.1 | RDCCAATCASN | NFYC | 2.14e-09 | 3.35e-12 | 7840 | 21.5 | 9870 | 19.6 | 1.1 |
301 | 19 | MA0908.1 | HOXD11 | MA0908.1 | RTCGTAAAAH | HOXD11 | 2.17e-09 | 3.39e-12 | 4940 | 13.6 | 6030 | 12 | 1.13 |
302 | 36 | MA1709.1 | ZIM3 | MA1709.1 | RVMAACAGAAACCYM | ZIM3 | 2.48e-09 | 3.85e-12 | 19900 | 54.6 | 26300 | 52.3 | 1.05 |
303 | 30 | MA0488.1 | JUN | MA0488.1 | DDRATGATGTCAT | JUN | 2.75e-09 | 4.26e-12 | 6060 | 16.6 | 7510 | 14.9 | 1.12 |
304 | 37 | MA1720.1 | ZNF85 | MA1720.1 | DHDGAGATTACWKCAK | ZNF85 | 2.84e-09 | 4.38e-12 | 22100 | 60.8 | 29400 | 58.4 | 1.04 |
305 | 46 | MA1478.1 | DMRTA2 | MA1478.1 | AATTGHTACAWT | DMRTA2 | 3.47e-09 | 5.34e-12 | 32300 | 88.6 | 43800 | 87.1 | 1.02 |
306 | 51 | MA0698.1 | ZBTB18 | MA0698.1 | NATCCAGATGTKB | ZBTB18 | 4.5e-09 | 6.9e-12 | 12300 | 33.9 | 16000 | 31.7 | 1.07 |
307 | 4 | MA0515.1 | Sox6 | MA0515.1 | CCWTTGTYYY | Sox6 | 6.64e-09 | 1.01e-11 | 4270 | 11.7 | 5180 | 10.3 | 1.14 |
308 | 56 | MA0479.1 | FOXH1 | MA0479.1 | BNSAATCCACA | FOXH1 | 9.19e-09 | 1.4e-11 | 3880 | 10.6 | 4670 | 9.27 | 1.15 |
309 | 30 | MA1143.1 | FOSL1::JUND | MA1143.1 | RTGACGTMAY | FOSL1::JUND | 1.46e-08 | 2.22e-11 | 33100 | 91 | 45200 | 89.7 | 1.02 |
310 | 47 | MA1151.1 | RORC | MA1151.1 | NWAWNTRGGTCA | RORC | 1.61e-08 | 2.43e-11 | 33100 | 91 | 45100 | 89.6 | 1.02 |
311 | 32 | MA0757.1 | ONECUT3 | MA0757.1 | AAAAAATCRATAAH | ONECUT3 | 1.63e-08 | 2.46e-11 | 10200 | 28 | 13100 | 26 | 1.08 |
312 | 21 | MA1994.1 | Nkx2-1 | MA1994.1 | NVCACTTGARHNN | Nkx2-1 | 2.68e-08 | 4.02e-11 | 34500 | 94.8 | 47200 | 93.8 | 1.01 |
313 | 57 | MA0161.2 (NFIC) | STREME-4 | 4-TGCCTGGCACA | TGCCTGGCACA | NFIC | 2.84e-08 | 4.26e-11 | 267 | 0.73 | 201 | 0.4 | 1.83 |
314 | 43 | MA2001.1 | Six4 | MA2001.1 | GDAACCTGANH | Six4 | 3.97e-08 | 5.94e-11 | 35400 | 97.3 | 48600 | 96.5 | 1.01 |
315 | 30 | MA0631.1 | Six3 | MA0631.1 | DATRGGGTATCAYNWNT | Six3 | 5.63e-08 | 8.39e-11 | 14700 | 40.4 | 19200 | 38.2 | 1.06 |
316 | 58 | MA0691.1 | TFAP4 | MA0691.1 | AWCAGCTGWT | TFAP4 | 1.14e-07 | 1.7e-10 | 6440 | 17.7 | 8090 | 16.1 | 1.1 |
317 | 59 | MA1643.1 | NFIB | MA1643.1 | NNCCTGGCAYNGTGCCAAGNN | NFIB | 1.6e-07 | 2.36e-10 | 1700 | 4.66 | 1910 | 3.79 | 1.23 |
318 | 60 | MA1561.1 | SOX12 | MA1561.1 | ACCGAACAATV | SOX12 | 2.12e-07 | 3.12e-10 | 25900 | 71.2 | 34800 | 69.2 | 1.03 |
319 | 19 | MA1503.1 | HOXB9 | MA1503.1 | GTCGTAAAAH | HOXB9 | 2.19e-07 | 3.22e-10 | 2290 | 6.29 | 2660 | 5.29 | 1.19 |
320 | 22 | MA0040.1 | Foxq1 | MA0040.1 | HATTGTTTATW | Foxq1 | 2.29e-07 | 3.36e-10 | 6320 | 17.4 | 7940 | 15.8 | 1.1 |
321 | 61 | MA0060.3 | NFYA | MA0060.3 | RRCCAATCAGM | NFYA | 2.62e-07 | 3.83e-10 | 6040 | 16.6 | 7570 | 15 | 1.1 |
322 | 3 | MA0468.1 | DUX4 | MA0468.1 | TAAYYYAATCA | DUX4 | 3.67e-07 | 5.35e-10 | 6020 | 16.5 | 7550 | 15 | 1.1 |
323 | 61 | MA0502.2 | NFYB | MA0502.2 | CYCATTGGCCVV | NFYB | 4.22e-07 | 6.13e-10 | 1390 | 3.83 | 1540 | 3.06 | 1.25 |
324 | 51 | MA0678.1 | OLIG2 | MA0678.1 | AMCATATGKT | OLIG2 | 5.4e-07 | 7.83e-10 | 20700 | 56.8 | 27500 | 54.7 | 1.04 |
325 | 40 | MA0775.1 | MEIS3 | MA0775.1 | DTGACAGS | MEIS3 | 5.54e-07 | 8e-10 | 1800 | 4.94 | 2050 | 4.08 | 1.21 |
326 | 62 | MA1980.1 | ZNF418 | MA1980.1 | MDGAGGCTAAAAGCA | ZNF418 | 6.14e-07 | 8.84e-10 | 20000 | 54.9 | 26600 | 52.8 | 1.04 |
327 | 51 | MA0669.1 | NEUROG2 | MA0669.1 | RACATATGTC | NEUROG2 | 7.73e-07 | 1.11e-09 | 13200 | 36.2 | 17200 | 34.2 | 1.06 |
328 | 3 | MA0897.1 | Hmx2 | MA0897.1 | ASAAGCAATTAAHVVRT | Hmx2 | 7.85e-07 | 1.12e-09 | 7270 | 20 | 9240 | 18.4 | 1.09 |
329 | 9 | MA1473.1 | CDX4 | MA1473.1 | GGYMATAAAAC | CDX4 | 9.45e-07 | 1.35e-09 | 2960 | 8.12 | 3540 | 7.03 | 1.15 |
330 | 22 | MA0593.1 | FOXP2 | MA0593.1 | RWGTAAACAVR | FOXP2 | 1.48e-06 | 2.11e-09 | 8660 | 23.8 | 11100 | 22.1 | 1.08 |
331 | 19 | MA1506.1 | HOXD10 | MA1506.1 | RGTCGTAAAAH | HOXD10 | 1.69e-06 | 2.39e-09 | 5450 | 15 | 6820 | 13.6 | 1.1 |
332 | 30 | MA0693.3 | Vdr | MA0693.3 | NNGAGTTCANN | Vdr | 2.23e-06 | 3.16e-09 | 35500 | 97.6 | 48800 | 97 | 1.01 |
333 | 51 | MA0461.2 | Atoh1 | MA0461.2 | RMCATATGBY | Atoh1 | 2.56e-06 | 3.6e-09 | 12600 | 34.7 | 16500 | 32.8 | 1.06 |
334 | 28 | MA0114.4 | HNF4A | MA0114.4 | DNCAAAGTCCANN | HNF4A | 2.77e-06 | 3.89e-09 | 35100 | 96.5 | 48200 | 95.7 | 1.01 |
335 | 53 | MA0595.1 | SREBF1 | MA0595.1 | VTCACCCCAY | SREBF1 | 2.9e-06 | 4.07e-09 | 7500 | 20.6 | 9570 | 19 | 1.08 |
336 | 59 | MA0119.1 | NFIC::TLX1 | MA0119.1 | TGGCASSRWGCCAA | NFIC::TLX1 | 4.52e-06 | 6.32e-09 | 1570 | 4.32 | 1790 | 3.56 | 1.22 |
337 | 63 | MA1124.1 | ZNF24 | MA1124.1 | CATTCATTCATTC | ZNF24 | 7.2e-06 | 1e-08 | 3650 | 10 | 4470 | 8.88 | 1.13 |
338 | 28 | MA0484.2 | HNF4G | MA0484.2 | DNCAAAGTCCANN | HNF4G | 9.22e-06 | 1.28e-08 | 35100 | 96.4 | 48200 | 95.6 | 1.01 |
339 | 3 | MA0135.1 | Lhx3 | MA0135.1 | WAATTAATTAWYY | Lhx3 | 1.17e-05 | 1.62e-08 | 6460 | 17.7 | 8200 | 16.3 | 1.09 |
340 | 10 | MA0029.1 | Mecom | MA0029.1 | AAGAYAAGATAANA | Mecom | 1.38e-05 | 1.9e-08 | 4860 | 13.3 | 6080 | 12.1 | 1.11 |
341 | 18 | MA0869.2 | Sox11 | MA0869.2 | NRGAACAAAGVV | Sox11 | 1.4e-05 | 1.93e-08 | 16700 | 45.8 | 22100 | 43.9 | 1.04 |
342 | 30 | MA1145.1 | FOSL2::JUND | MA1145.1 | NNRTGACGTCAYHSN | FOSL2::JUND | 1.82e-05 | 2.5e-08 | 25200 | 69.2 | 33900 | 67.4 | 1.03 |
343 | 30 | MA0492.1 | JUND | MA0492.1 | DDDRATGABGTCATN | JUND | 2.65e-05 | 3.62e-08 | 6370 | 17.5 | 8100 | 16.1 | 1.09 |
344 | 45 | MA0518.1 | Stat4 | MA0518.1 | YTTCYRGGAARNVR | Stat4 | 3.16e-05 | 4.31e-08 | 19000 | 52.2 | 25300 | 50.3 | 1.04 |
345 | 64 | MA0797.1 | TGIF2 | MA0797.1 | TGACAGSTGTCA | TGIF2 | 5.35e-05 | 7.27e-08 | 3690 | 10.1 | 4560 | 9.05 | 1.12 |
346 | 30 | MA0829.2 | SREBF1 | MA0829.2 | DNATCACSTGATNH | SREBF1 | 6.05e-05 | 8.21e-08 | 17000 | 46.8 | 22600 | 45 | 1.04 |
347 | 30 | MA1133.1 | JUN::JUNB | MA1133.1 | KRTGACGTCATN | JUN::JUNB | 8.03e-05 | 1.09e-07 | 22300 | 61.2 | 29900 | 59.5 | 1.03 |
348 | 59 | MA1527.1 | NFIC | MA1527.1 | NTTGGCDNNRTGCCARN | NFIC | 9.65e-05 | 1.3e-07 | 1700 | 4.67 | 1990 | 3.95 | 1.18 |
349 | 30 | MA1136.1 | FOSB::JUNB | MA1136.1 | RTGACGTCAT | FOSB::JUNB | 0.000105 | 1.41e-07 | 23400 | 64.2 | 31400 | 62.5 | 1.03 |
350 | 47 | MA0494.1 | Nr1h3::Rxra | MA0494.1 | TGACCTNNAGTRACCYYDN | Nr1h3::Rxra | 0.000113 | 1.52e-07 | 25100 | 68.9 | 33800 | 67.2 | 1.02 |
351 | 65 | MA1989.1 | Bcl11B | MA1989.1 | NNAAACCACAARNN | Bcl11B | 0.000116 | 1.56e-07 | 27100 | 74.5 | 36700 | 72.9 | 1.02 |
352 | 36 | MA1508.1 | IKZF1 | MA1508.1 | VVAACAGGAARN | IKZF1 | 0.000132 | 1.76e-07 | 31300 | 86 | 42700 | 84.7 | 1.01 |
353 | 9 | MA0913.2 | HOXD9 | MA0913.2 | GYMATAAAAH | HOXD9 | 0.00014 | 1.87e-07 | 3060 | 8.42 | 3760 | 7.47 | 1.13 |
354 | 30 | MA1126.1 | FOS::JUN | MA1126.1 | DRTGACGTCATHNDTN | FOS::JUN | 0.000164 | 2.17e-07 | 33200 | 91.3 | 45400 | 90.3 | 1.01 |
355 | 66 | MA0106.3 | TP53 | MA0106.3 | RACATGYCCGGRCATGTY | TP53 | 0.000165 | 2.18e-07 | 278 | 0.76 | 246 | 0.49 | 1.56 |
356 | 67 | MA0101.1 | REL | MA0101.1 | BGGRNWTTCC | REL | 0.000174 | 2.29e-07 | 32000 | 87.9 | 43700 | 86.7 | 1.01 |
357 | 68 | MA0136.3 | Elf5 | MA0136.3 | RVAAGGAAGTNN | Elf5 | 0.000217 | 2.85e-07 | 35800 | 98.4 | 49300 | 97.9 | 1 |
358 | 30 | MA1129.1 | FOSL1::JUN | MA1129.1 | ATGACGTCAT | FOSL1::JUN | 0.000362 | 4.75e-07 | 23700 | 65.1 | 31900 | 63.4 | 1.03 |
359 | 3 | MA0784.2 | POU1F1 | MA0784.2 | ANCTMATTWGCATAWT | POU1F1 | 0.000631 | 8.24e-07 | 4870 | 13.4 | 6170 | 12.3 | 1.09 |
360 | 36 | MA1952.1 | FOXJ2::ELF1 | MA1952.1 | RTAAACMGGAAGTR | FOXJ2::ELF1 | 0.000663 | 8.63e-07 | 32400 | 88.9 | 44200 | 87.8 | 1.01 |
361 | 51 | MA1618.1 | Ptf1a | MA1618.1 | NNACAGATGTTNN | Ptf1a | 0.00077 | 1e-06 | 34500 | 94.8 | 47400 | 94.1 | 1.01 |
362 | 51 | MA0744.2 | SCRT2 | MA0744.2 | NNWGCAACAGGTGDNN | SCRT2 | 0.000789 | 1.02e-06 | 31800 | 87.4 | 43400 | 86.3 | 1.01 |
363 | 51 | MA0827.1 | OLIG3 | MA0827.1 | AMCATATGBY | OLIG3 | 0.000845 | 1.09e-06 | 24700 | 67.8 | 33300 | 66.2 | 1.02 |
364 | 32 | MA0780.1 | PAX3 | MA0780.1 | TAATYRATTA | PAX3 | 0.000922 | 1.19e-06 | 4810 | 13.2 | 6110 | 12.1 | 1.09 |
13 | 74 | TTGAGSYCAGGAGTT | MEME-6 | TTGAGSYCAGGAGTT | TTGAGSYCAGGAGTT | TTGAGSYCAGGAGTT | 0.0011 | 8.47e-05 | 683 | 1.88 | 773 | 1.54 | 1.22 |
365 | 22 | MA0030.1 | FOXF2 | MA0030.1 | BNAASGTAAACAAD | FOXF2 | 0.00119 | 1.53e-06 | 4580 | 12.6 | 5800 | 11.5 | 1.09 |
366 | 59 | MA1528.1 | NFIX | MA1528.1 | STTGGCRNNGTGCCARB | NFIX | 0.00142 | 1.81e-06 | 2880 | 7.91 | 3560 | 7.07 | 1.12 |
367 | 69 | MA1592.1 | ZNF274 | MA1592.1 | NRTRTGAGTTCTCGYN | ZNF274 | 0.00311 | 3.98e-06 | 7670 | 21.1 | 9980 | 19.8 | 1.06 |
368 | 70 | MA0520.1 | Stat6 | MA0520.1 | BDBTTCCWSAGAAVY | Stat6 | 0.00419 | 5.34e-06 | 10100 | 27.8 | 13300 | 26.5 | 1.05 |
369 | 51 | MA0826.1 | OLIG1 | MA0826.1 | AMCATATGKT | OLIG1 | 0.00524 | 6.67e-06 | 8720 | 24 | 11400 | 22.7 | 1.06 |
370 | 26 | MA0639.1 | DBP | MA0639.1 | NRTTACGTAAYV | DBP | 0.00547 | 6.93e-06 | 6230 | 17.1 | 8050 | 16 | 1.07 |
371 | 22 | MA0613.1 | FOXG1 | MA0613.1 | RTAAACAW | FOXG1 | 0.00572 | 7.23e-06 | 5560 | 15.3 | 7160 | 14.2 | 1.07 |
372 | 30 | MA0592.3 | ESRRA | MA0592.3 | NBTCAAGGTCAHN | ESRRA | 0.00604 | 7.62e-06 | 33000 | 90.6 | 45200 | 89.7 | 1.01 |
373 | 58 | MA0633.2 | Twist2 | MA0633.2 | NVCAGCTGBN | Twist2 | 0.00714 | 8.98e-06 | 5010 | 13.8 | 6420 | 12.8 | 1.08 |
374 | 18 | MA1421.1 | TCF7L1 | MA1421.1 | AAAGATCAAAGG | TCF7L1 | 0.00734 | 9.21e-06 | 3390 | 9.31 | 4260 | 8.47 | 1.1 |
375 | 14 | MA0877.3 | BARHL1 | MA0877.3 | AMCGTTTA | BARHL1 | 0.0129 | 1.61e-05 | 8070 | 22.2 | 10600 | 21 | 1.06 |
376 | 36 | MA0517.1 | STAT1::STAT2 | MA0517.1 | THAGTTTCRKTTTCY | STAT1::STAT2 | 0.0136 | 1.7e-05 | 17200 | 47.1 | 23000 | 45.7 | 1.03 |
377 | 57 | MA0161.2 | NFIC | MA0161.2 | NNCTTGGCANN | NFIC | 0.0164 | 2.04e-05 | 35200 | 96.6 | 48400 | 96.1 | 1.01 |
378 | 11 | MA1487.2 | FOXE1 | MA1487.2 | VYTAAAYAAACAAH | FOXE1 | 0.0174 | 2.17e-05 | 553 | 1.52 | 600 | 1.19 | 1.27 |
379 | 71 | MA0484.2 (HNF4G) | STREME-6 | 6-CAAAGTGCTGGG | CAAAGTGCTGGG | HNF4G | 0.0182 | 2.25e-05 | 301 | 0.83 | 297 | 0.59 | 1.4 |
380 | 30 | MA1996.1 | Nr1H2 | MA1996.1 | NNRAGGTCANN | Nr1H2 | 0.0195 | 2.41e-05 | 34900 | 96 | 48000 | 95.4 | 1.01 |
381 | 30 | MA0629.1 | Rhox11 | MA0629.1 | ANKNCGCTGTWAWDVRW | Rhox11 | 0.0204 | 2.51e-05 | 14800 | 40.7 | 19800 | 39.4 | 1.03 |
382 | 72 | MA0682.2 (PITX1) | MEME-4 | WGCTGGGATTACAGG | WGCTGGGATTACAGG | PITX1 | 0.023 | 2.83e-05 | 900 | 2.47 | 1040 | 2.06 | 1.2 |
383 | 58 | MA0667.1 | MYF6 | MA0667.1 | AACARCTGTT | MYF6 | 0.0357 | 4.37e-05 | 5340 | 14.7 | 6910 | 13.7 | 1.07 |
384 | 19 | MA0485.2 | HOXC9 | MA0485.2 | GTCGTAAAAH | HOXC9 | 0.0361 | 4.41e-05 | 3250 | 8.94 | 4120 | 8.18 | 1.09 |
385 | 45 | MA1625.1 | Stat5b | MA1625.1 | NNNTTCCCAGAANNN | Stat5b | 0.0364 | 4.44e-05 | 20200 | 55.6 | 27300 | 54.3 | 1.02 |
386 | 66 | MA0525.2 | TP63 | MA0525.2 | RACATGTYGKGACATGTC | TP63 | 0.0376 | 4.57e-05 | 275 | 0.76 | 271 | 0.54 | 1.4 |
387 | 73 | MA0069.1 | PAX6 | MA0069.1 | TTCACGCWTSANTK | PAX6 | 0.0385 | 4.66e-05 | 27900 | 76.6 | 38000 | 75.4 | 1.02 |
14 | 75 | MA0090.3 (TEAD1) | MEME-5 | CTGCACTCCAGCCTG | CTGCACTCCAGCCTG | TEAD1 | 0.105 | 0.00752 | 3540 | 9.72 | 4640 | 9.21 | 1.05 |
15 | 76 | MA1596.1 (ZNF460) | MEME-7 | RGAGGCWGAGGSAGG | RGAGGCWGAGGSAGG | ZNF460 | 0.329 | 0.0219 | 74 | 0.2 | 72 | 0.14 | 1.42 |
16 | 76 | MA1653.1 (ZNF148) | MEME-3 | CCCDSCCCCKCCYCC | CCCDSCCCCKCCYCC | ZNF148 | 5.87 | 0.367 | 142 | 0.39 | 188 | 0.37 | 1.05 |
EPI_24_20kb_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("EPI_24_20kb_combo response peaks(200bp)Enrichment ratio:",ER_rat))
EPI_24_20kb_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
dplyr::filter(EVALUE.y<0.05) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
# slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*15), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./15,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("EPI_24_20kb_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DNR_24_20kb_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DNR_24_20kb_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
DNR_24_20kb_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DNR_24_20kb_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
DNR_24_20kb_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DNR_24_20kb_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
DNR_24_20kb_sea_all <- rbind(DNR_24_20kb_sea_disc, DNR_24_20kb_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
DNR_24_20kb_combo <- DNR_24_20kb_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., DNR_24_20kb_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
DNR_24_20kb_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in DNR 24h 20kb") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA1134.1 (FOS::JUNB) | STREME-1 | 1-RATGASTCATY | RATGASTCATY | FOS::JUNB | 1.07e-255 | 6.3e-256 | 4120 | 9.28 | 1530 | 3.63 | 2.56 |
2 | 1 | MA1134.1 | FOS::JUNB | MA1134.1 | KATGASTCATHN | FOS::JUNB | 7.89e-246 | 2.32e-246 | 5300 | 11.9 | 2340 | 5.53 | 2.16 |
3 | 1 | MA1135.1 | FOSB::JUNB | MA1135.1 | KRTGASTCAT | FOSB::JUNB | 7.37e-244 | 1.44e-244 | 5710 | 12.8 | 2630 | 6.21 | 2.07 |
4 | 1 | MA0099.3 | FOS::JUN | MA0099.3 | ATGAGTCAYM | FOS::JUN | 5.01e-243 | 7.36e-244 | 4760 | 10.7 | 2000 | 4.72 | 2.27 |
5 | 1 | MA1144.1 | FOSL2::JUND | MA1144.1 | KATGACTCAT | FOSL2::JUND | 5.84e-242 | 6.87e-243 | 5800 | 13.1 | 2700 | 6.39 | 2.04 |
6 | 1 | MA1138.1 | FOSL2::JUNB | MA1138.1 | KRTGASTCAT | FOSL2::JUNB | 2.96e-239 | 2.9e-240 | 5660 | 12.7 | 2620 | 6.19 | 2.06 |
7 | 1 | MA0655.1 (JDP2) | MEME-2 | ATGASTCA | ATGASTCA | JDP2 | 7.98e-237 | 6.71e-238 | 5960 | 13.4 | 2840 | 6.7 | 2 |
8 | 1 | MA1130.1 | FOSL2::JUN | MA1130.1 | NNRTGAGTCAYN | FOSL2::JUN | 3.8e-229 | 2.8e-230 | 4600 | 10.3 | 1940 | 4.59 | 2.25 |
9 | 1 | MA1633.2 | BACH1 | MA1633.2 | ATGACTCAT | BACH1 | 3.38e-226 | 2.21e-227 | 5920 | 13.3 | 2870 | 6.78 | 1.97 |
10 | 1 | MA1141.1 | FOS::JUND | MA1141.1 | NKATGAGTCATNN | FOS::JUND | 5.53e-226 | 3.25e-227 | 4650 | 10.5 | 1990 | 4.71 | 2.22 |
11 | 1 | MA0491.2 | JUND | MA0491.2 | NNATGACTCATNN | JUND | 2.3e-224 | 1.23e-225 | 3810 | 8.58 | 1460 | 3.45 | 2.48 |
12 | 1 | MA0489.2 | Jun | MA0489.2 | NVTGACTCATNN | Jun | 3.64e-220 | 1.79e-221 | 6120 | 13.8 | 3040 | 7.18 | 1.92 |
13 | 1 | MA0490.2 | JUNB | MA0490.2 | NNATGACTCATNN | JUNB | 1.06e-217 | 4.79e-219 | 3700 | 8.33 | 1420 | 3.35 | 2.49 |
14 | 1 | MA0476.1 | FOS | MA0476.1 | DVTGASTCATB | FOS | 9.53999999999999e-213 | 4.01e-214 | 6530 | 14.7 | 3380 | 7.98 | 1.84 |
15 | 1 | MA1988.1 | Atf3 | MA1988.1 | NRTGACTCABN | Atf3 | 1.6e-211 | 6.27e-213 | 4130 | 9.29 | 1720 | 4.05 | 2.29 |
16 | 1 | MA1137.1 | FOSL1::JUNB | MA1137.1 | NDRTGACTCATNN | FOSL1::JUNB | 4.27e-211 | 1.57e-212 | 4420 | 9.96 | 1910 | 4.52 | 2.2 |
17 | 1 | MA0462.2 | BATF::JUN | MA0462.2 | DATGACTCATH | BATF::JUN | 1.43e-209 | 4.97e-211 | 4220 | 9.5 | 1780 | 4.21 | 2.25 |
18 | 1 | MA1634.1 | BATF | MA1634.1 | DATGACTCATH | BATF | 1.19e-206 | 3.88e-208 | 3590 | 8.08 | 1390 | 3.29 | 2.46 |
19 | 1 | MA0478.1 | FOSL2 | MA0478.1 | KRRTGASTCAB | FOSL2 | 4.85e-205 | 1.5e-206 | 5900 | 13.3 | 2960 | 7.01 | 1.89 |
20 | 1 | MA1128.1 | FOSL1::JUN | MA1128.1 | NKATGACTCATNN | FOSL1::JUN | 7.93e-204 | 2.33e-205 | 3290 | 7.41 | 1220 | 2.88 | 2.58 |
21 | 1 | MA0835.2 | BATF3 | MA0835.2 | DATGACTCATH | BATF3 | 7.52e-202 | 2.11e-203 | 4050 | 9.11 | 1700 | 4.03 | 2.26 |
22 | 1 | MA0477.2 | FOSL1 | MA0477.2 | NNATGACTCATNN | FOSL1 | 3.97e-198 | 1.06e-199 | 3040 | 6.83 | 1080 | 2.55 | 2.67 |
23 | 1 | MA0655.1 | JDP2 | MA0655.1 | ATGACTCAT | JDP2 | 5.7e-197 | 1.46e-198 | 5380 | 12.1 | 2640 | 6.24 | 1.94 |
24 | 1 | MA1142.1 | FOSL1::JUND | MA1142.1 | NRTGACTCAT | FOSL1::JUND | 7.18e-186 | 1.76e-187 | 3610 | 8.13 | 1490 | 3.52 | 2.31 |
25 | 1 | MA1101.2 | BACH2 | MA1101.2 | DWANCATGASTCATSNTWH | BACH2 | 1.05e-178 | 2.47e-180 | 5340 | 12 | 2710 | 6.4 | 1.88 |
26 | 1 | MA1928.1 | BNC2 | MA1928.1 | NDRTGAGTCABN | BNC2 | 3.7e-169 | 8.37e-171 | 6080 | 13.7 | 3310 | 7.82 | 1.75 |
27 | 1 | MA0841.1 | NFE2 | MA0841.1 | VATGACTCATS | NFE2 | 8.74e-169 | 1.9e-170 | 7040 | 15.8 | 4040 | 9.55 | 1.66 |
28 | 1 | MA1132.1 | JUN::JUNB | MA1132.1 | KATGACKCAT | JUN::JUNB | 1.41e-128 | 2.97e-130 | 1910 | 4.3 | 659 | 1.56 | 2.76 |
29 | 1 | MA0496.3 | MAFK | MA0496.3 | NGMTGACTCAGCMNH | MAFK | 2.98e-80 | 6.05e-82 | 10100 | 22.8 | 7430 | 17.6 | 1.3 |
30 | 1 | MA0501.1 | MAF::NFE2 | MA0501.1 | ATGACTCAGCANWTY | MAF::NFE2 | 2.4e-71 | 4.7e-73 | 9320 | 21 | 6840 | 16.2 | 1.3 |
31 | 1 | MA0089.2 | MAFG::NFE2L1 | MA0089.2 | NVNATGACTCAGCADW | MAFG::NFE2L1 | 1.57e-64 | 2.99e-66 | 3060 | 6.88 | 1780 | 4.2 | 1.64 |
32 | 1 | MA0659.3 | Mafg | MA0659.3 | NWGMTGACTCAGCAN | Mafg | 3.63e-44 | 6.67e-46 | 3610 | 8.13 | 2390 | 5.66 | 1.44 |
33 | 2 | 6-ACATACACACAM | STREME-6 | 6-ACATACACACAM | ACATACACACAM | ACATACACACAM | 1.55e-41 | 2.76e-43 | 1020 | 2.29 | 458 | 1.08 | 2.12 |
34 | 1 | MA0150.2 | Nfe2l2 | MA0150.2 | CASNATGACTCAGCA | Nfe2l2 | 7.34e-41 | 1.27e-42 | 14700 | 33 | 12100 | 28.6 | 1.15 |
35 | 2 | 4-ACACACATACA | STREME-4 | 4-ACACACATACA | ACACACATACA | ACACACATACA | 2.47e-34 | 4.15e-36 | 1070 | 2.41 | 533 | 1.26 | 1.91 |
36 | 2 | TGTGTGTGTGT | MEME-4 | TGTGTGTGTGT | TGTGTGTGTGT | TGTGTGTGTGT | 1.55e-31 | 2.53e-33 | 1900 | 4.28 | 1170 | 2.76 | 1.55 |
37 | 2 | MA1107.2 (KLF9) | STREME-2 | 2-CACACACACAC | CACACACACAC | KLF9 | 5.05e-29 | 8.03e-31 | 1670 | 3.76 | 1010 | 2.39 | 1.57 |
38 | 3 | MA1990.1 (Gli1) | STREME-8 | 8-CACACACCACACAYA | CACACACCACACAYA | Gli1 | 1.71e-28 | 2.65e-30 | 169 | 0.38 | 17 | 0.04 | 8.99 |
39 | 2 | MA1155.1 (ZSCAN4) | STREME-5 | 5-TGCACACACACA | TGCACACACACA | ZSCAN4 | 1.25e-25 | 1.89e-27 | 1360 | 3.06 | 803 | 1.9 | 1.61 |
40 | 4 | 3-ATGTGTG | STREME-3 | 3-ATGTGTG | ATGTGTG | ATGTGTG | 5.08e-24 | 7.48e-26 | 29600 | 66.6 | 26700 | 63.1 | 1.05 |
41 | 5 | MA0707.2 | MNX1 | MA0707.2 | VYRATTAK | MNX1 | 1.62e-22 | 2.33e-24 | 24100 | 54.2 | 21400 | 50.7 | 1.07 |
42 | 5 | MA0904.2 | HOXB5 | MA0904.2 | STAATTAS | HOXB5 | 4.47e-22 | 6.27e-24 | 24400 | 55 | 21800 | 51.5 | 1.07 |
43 | 5 | MA1495.1 | HOXA1 | MA1495.1 | STAATTAS | HOXA1 | 1.24e-20 | 1.7e-22 | 24100 | 54.2 | 21500 | 50.8 | 1.07 |
44 | 5 | MA1519.1 | LHX5 | MA1519.1 | STAATTAV | LHX5 | 3.89e-20 | 5.2e-22 | 18400 | 41.4 | 16100 | 38.1 | 1.09 |
45 | 5 | MA0158.2 | HOXA5 | MA0158.2 | GYMATTAS | HOXA5 | 6e-20 | 7.85e-22 | 20300 | 45.7 | 17900 | 42.4 | 1.08 |
46 | 6 | MA0626.1 | Npas2 | MA0626.1 | NSCACGTGTN | Npas2 | 2.66e-18 | 3.4e-20 | 18500 | 41.6 | 16300 | 38.5 | 1.08 |
47 | 1 | MA0591.1 | Bach1::Mafk | MA0591.1 | RSSATGACTCAGCAB | Bach1::Mafk | 6.06e-18 | 7.59e-20 | 6670 | 15 | 5420 | 12.8 | 1.17 |
48 | 5 | MA0886.1 | EMX2 | MA0886.1 | VBTAATTARB | EMX2 | 1.47e-17 | 1.81e-19 | 28500 | 64.2 | 25900 | 61.2 | 1.05 |
49 | 5 | MA0705.1 | Lhx8 | MA0705.1 | CTAATTAV | Lhx8 | 6.22e-16 | 7.47e-18 | 24700 | 55.5 | 22200 | 52.6 | 1.06 |
50 | 5 | MA0661.1 | MEOX1 | MA0661.1 | VSTAATTAHC | MEOX1 | 4.86e-15 | 5.72e-17 | 19000 | 42.7 | 16800 | 39.8 | 1.07 |
51 | 7 | MA1155.1 | ZSCAN4 | MA1155.1 | TGCACACACTGAAAA | ZSCAN4 | 2.91e-14 | 3.36e-16 | 1040 | 2.33 | 656 | 1.55 | 1.5 |
52 | 5 | MA0912.2 | HOXD3 | MA0912.2 | CTAATTAC | HOXD3 | 7.43e-14 | 8.41e-16 | 15000 | 33.9 | 13200 | 31.2 | 1.08 |
53 | 8 | MA0041.2 | FOXD3 | MA0041.2 | WAWGHAAATAAACARY | FOXD3 | 1.09e-13 | 1.21e-15 | 33100 | 74.6 | 30500 | 72.1 | 1.03 |
54 | 8 | MA1606.1 | Foxf1 | MA1606.1 | DWRTAAACANN | Foxf1 | 2.55e-13 | 2.75e-15 | 39500 | 88.8 | 36800 | 87.1 | 1.02 |
55 | 5 | MA0902.2 | HOXB2 | MA0902.2 | SYAATTAS | HOXB2 | 2.57e-13 | 2.75e-15 | 16000 | 36.1 | 14200 | 33.5 | 1.08 |
56 | 9 | MA1115.1 | POU5F1 | MA1115.1 | WHATGCAAATN | POU5F1 | 6.76e-13 | 7.1e-15 | 34200 | 77 | 31600 | 74.7 | 1.03 |
57 | 5 | MA0612.2 | EMX1 | MA0612.2 | BTAATTAS | EMX1 | 1.74e-12 | 1.79e-14 | 14500 | 32.7 | 12800 | 30.2 | 1.08 |
58 | 8 | MA0846.1 | FOXC2 | MA0846.1 | WAWGTAAACAWW | FOXC2 | 2.02e-12 | 2.05e-14 | 36600 | 82.4 | 34000 | 80.4 | 1.03 |
59 | 5 | MA0910.2 | HOXD8 | MA0910.2 | GYMATTAM | HOXD8 | 2.27e-12 | 2.27e-14 | 23000 | 51.9 | 20800 | 49.2 | 1.05 |
60 | 10 | MA1143.1 | FOSL1::JUND | MA1143.1 | RTGACGTMAY | FOSL1::JUND | 2.4e-12 | 2.35e-14 | 16000 | 36 | 14200 | 33.5 | 1.08 |
61 | 1 | MA1639.1 | MEIS1 | MA1639.1 | NHCATMAATCAHN | MEIS1 | 3.26e-12 | 3.14e-14 | 33900 | 76.3 | 31300 | 74 | 1.03 |
62 | 9 | MA0792.1 | POU5F1B | MA0792.1 | TATGCWAAT | POU5F1B | 3.4e-12 | 3.23e-14 | 35600 | 80.1 | 33000 | 77.9 | 1.03 |
63 | 5 | MA0706.1 | MEOX2 | MA0706.1 | DSTAATTAWN | MEOX2 | 5.85e-12 | 5.46e-14 | 25400 | 57.2 | 23100 | 54.6 | 1.05 |
64 | 5 | MA0027.2 | EN1 | MA0027.2 | SYAATTAV | EN1 | 6.26e-12 | 5.75e-14 | 32400 | 72.8 | 29800 | 70.5 | 1.03 |
65 | 11 | MA0502.2 (NFYB) | STREME-9 | 9-CRGCCAATGG | CRGCCAATGG | NFYB | 8.75e-12 | 7.91e-14 | 286 | 0.64 | 123 | 0.29 | 2.2 |
66 | 13 | MA0766.2 | GATA5 | MA0766.2 | SWGATAASRN | GATA5 | 8.88e-12 | 7.91e-14 | 7680 | 17.3 | 6500 | 15.4 | 1.13 |
67 | 5 | MA1502.1 | HOXB8 | MA1502.1 | GYMATTAM | HOXB8 | 9.3e-12 | 8.16e-14 | 22800 | 51.4 | 20700 | 48.8 | 1.05 |
68 | 8 | MA0845.1 | FOXB1 | MA0845.1 | WATGTAAATAT | FOXB1 | 1.31e-11 | 1.14e-13 | 34200 | 77 | 31600 | 74.8 | 1.03 |
69 | 6 | MA0093.3 | USF1 | MA0093.3 | NDGTCATGTGACHN | USF1 | 1.33e-11 | 1.14e-13 | 35700 | 80.4 | 33100 | 78.3 | 1.03 |
70 | 12 | MA0676.1 | Nr2e1 | MA0676.1 | AAAAGTCAA | Nr2e1 | 1.66e-11 | 1.39e-13 | 12600 | 28.3 | 11000 | 26 | 1.09 |
71 | 13 | MA0036.3 (GATA2) | STREME-10 | 10-WGATAAG | 10-WGATAAG | GATA2 | 2.08e-11 | 1.73e-13 | 28900 | 65 | 26500 | 62.5 | 1.04 |
72 | 11 | MA0502.2 | NFYB | MA0502.2 | CYCATTGGCCVV | NFYB | 2.59e-11 | 2.12e-13 | 2650 | 5.97 | 2040 | 4.81 | 1.24 |
73 | 14 | MA0106.3 | TP53 | MA0106.3 | RACATGYCCGGRCATGTY | TP53 | 2.7e-11 | 2.18e-13 | 5100 | 11.5 | 4190 | 9.9 | 1.16 |
74 | 5 | MA1530.1 | NKX6-3 | MA1530.1 | VYCATTAWW | NKX6-3 | 5.68e-11 | 4.52e-13 | 23700 | 53.2 | 21500 | 50.7 | 1.05 |
75 | 9 | MA0142.1 | Pou5f1::Sox2 | MA0142.1 | CWTTGTYATGCAAAT | Pou5f1::Sox2 | 6.16e-11 | 4.83e-13 | 31500 | 70.9 | 29000 | 68.6 | 1.03 |
76 | 15 | MA0148.4 | FOXA1 | MA0148.4 | WWGTAAACATNN | FOXA1 | 6.52e-11 | 5.05e-13 | 36800 | 82.8 | 34200 | 80.8 | 1.02 |
77 | 1 | MA1640.1 | MEIS2 | MA1640.1 | NNATGATTTATGDNH | MEIS2 | 7.83e-11 | 5.98e-13 | 35500 | 79.8 | 32900 | 77.8 | 1.03 |
78 | 9 | MA0786.1 | POU3F1 | MA0786.1 | WTATGCWAATKW | POU3F1 | 1.16e-10 | 8.72e-13 | 34200 | 77 | 31700 | 74.8 | 1.03 |
79 | 1 | MA1113.2 | PBX2 | MA1113.2 | NHCATAAATCAHN | PBX2 | 1.64e-10 | 1.22e-12 | 30300 | 68.2 | 27900 | 65.9 | 1.04 |
80 | 9 | MA0785.1 | POU2F1 | MA0785.1 | AWTATGCWAATK | POU2F1 | 2.42e-10 | 1.78e-12 | 32100 | 72.3 | 29700 | 70.1 | 1.03 |
81 | 15 | MA0848.1 | FOXO4 | MA0848.1 | GTAAACA | FOXO4 | 2.62e-10 | 1.9e-12 | 36200 | 81.5 | 33600 | 79.5 | 1.02 |
82 | 9 | MA0789.1 | POU3F4 | MA0789.1 | TATGCWAAT | POU3F4 | 2.89e-10 | 2.07e-12 | 29200 | 65.7 | 26800 | 63.4 | 1.04 |
83 | 8 | MA1683.1 | FOXA3 | MA1683.1 | DWGTAAACANN | FOXA3 | 3.57e-10 | 2.53e-12 | 34400 | 77.3 | 31800 | 75.3 | 1.03 |
84 | 5 | MA0879.2 | DLX1 | MA0879.2 | VYAATTAN | DLX1 | 3.78e-10 | 2.65e-12 | 34000 | 76.4 | 31400 | 74.3 | 1.03 |
85 | 5 | MA1505.1 | HOXC8 | MA1505.1 | KTAATTAM | HOXC8 | 6.16e-10 | 4.26e-12 | 33700 | 75.9 | 31200 | 73.8 | 1.03 |
86 | 15 | MA0033.2 | FOXL1 | MA0033.2 | RTAAACA | FOXL1 | 9.73e-10 | 6.66e-12 | 35800 | 80.4 | 33200 | 78.5 | 1.02 |
87 | 9 | MA0787.1 | POU3F2 | MA0787.1 | WTATGCWAATKA | POU3F2 | 9.93e-10 | 6.72e-12 | 29500 | 66.5 | 27200 | 64.2 | 1.04 |
88 | 5 | MA0900.2 | HOXA2 | MA0900.2 | STAATTAS | HOXA2 | 1.14e-09 | 7.6e-12 | 16400 | 36.8 | 14600 | 34.6 | 1.07 |
89 | 5 | MA0675.1 | NKX6-2 | MA0675.1 | NYMATTAA | NKX6-2 | 1.5e-09 | 9.93e-12 | 32300 | 72.7 | 29900 | 70.6 | 1.03 |
90 | 6 | MA1108.2 | MXI1 | MA1108.2 | NNCACATGBN | MXI1 | 1.55e-09 | 1.01e-11 | 19300 | 43.4 | 17400 | 41.1 | 1.06 |
91 | 8 | MA0481.3 | FOXP1 | MA0481.3 | NDGTAAACANH | FOXP1 | 1.63e-09 | 1.06e-11 | 38100 | 85.8 | 35600 | 84.1 | 1.02 |
92 | 8 | MA0047.3 | FOXA2 | MA0047.3 | RWGTAAACANN | FOXA2 | 2.26e-09 | 1.45e-11 | 39400 | 88.6 | 36900 | 87.1 | 1.02 |
93 | 23 | MA0498.2 | MEIS1 | MA0498.2 | HTGACAD | MEIS1 | 4.81e-09 | 3.04e-11 | 35500 | 79.8 | 33000 | 78 | 1.02 |
94 | 5 | MA0701.2 | LHX9 | MA0701.2 | CYAATTAR | LHX9 | 5.96e-09 | 3.73e-11 | 13200 | 29.8 | 11700 | 27.7 | 1.08 |
95 | 8 | MA1103.2 | FOXK2 | MA1103.2 | DDGTAAACANN | FOXK2 | 6.94e-09 | 4.3e-11 | 41000 | 92.3 | 38500 | 91 | 1.01 |
96 | 5 | MA0644.2 | ESX1 | MA0644.2 | YYAATTAN | ESX1 | 7.47e-09 | 4.58e-11 | 26900 | 60.6 | 24700 | 58.3 | 1.04 |
97 | 9 | MA0627.2 | POU2F3 | MA0627.2 | NWTATGCAAATNH | POU2F3 | 1.62e-08 | 9.8e-11 | 31900 | 71.7 | 29500 | 69.7 | 1.03 |
98 | 16 | MA1479.1 | DMRTC2 | MA1479.1 | AATTGMTACATT | DMRTC2 | 2e-08 | 1.2e-10 | 34600 | 77.9 | 32200 | 76 | 1.02 |
99 | 5 | MA0708.2 | MSX2 | MA0708.2 | SCAATTAV | MSX2 | 2.37e-08 | 1.41e-10 | 30900 | 69.6 | 28600 | 67.6 | 1.03 |
100 | 9 | MA0788.1 | POU3F3 | MA0788.1 | WWTATGCWAATTW | POU3F3 | 2.56e-08 | 1.5e-10 | 31300 | 70.4 | 28900 | 68.4 | 1.03 |
101 | 8 | MA1978.1 | ZNF354A | MA1978.1 | HWAADWATAAATGRAYWAWTT | ZNF354A | 2.63e-08 | 1.53e-10 | 32700 | 73.5 | 30300 | 71.5 | 1.03 |
102 | 5 | MA1549.1 | POU6F1 | MA1549.1 | RTAATGAGVN | POU6F1 | 4.01e-08 | 2.31e-10 | 25800 | 58.2 | 23700 | 56 | 1.04 |
103 | 5 | MA0666.2 | MSX1 | MA0666.2 | SCAATTAR | MSX1 | 4.56e-08 | 2.61e-10 | 28700 | 64.5 | 26400 | 62.4 | 1.03 |
104 | 17 | MA1120.1 | SOX13 | MA1120.1 | DVACAATGGNN | SOX13 | 4.92e-08 | 2.78e-10 | 38400 | 86.3 | 35900 | 84.8 | 1.02 |
105 | 5 | MA0882.1 | DLX6 | MA0882.1 | VYAATTAN | DLX6 | 5.95e-08 | 3.34e-10 | 38400 | 86.3 | 35900 | 84.8 | 1.02 |
106 | 18 | MA0791.1 | POU4F3 | MA0791.1 | RTGMATWATTAATGAR | POU4F3 | 6.01e-08 | 3.34e-10 | 26400 | 59.4 | 24200 | 57.3 | 1.04 |
107 | 5 | MA0718.1 | RAX | MA0718.1 | RYYAATTARH | RAX | 6.68e-08 | 3.67e-10 | 36200 | 81.4 | 33700 | 79.7 | 1.02 |
108 | 23 | MA0117.2 | Mafb | MA0117.2 | AWWNTGCTGACD | Mafb | 9.29e-08 | 5.06e-10 | 34500 | 77.5 | 32000 | 75.7 | 1.02 |
109 | 23 | MA0842.2 | NRL | MA0842.2 | AAWWNTGCTGACG | NRL | 9.86e-08 | 5.32e-10 | 6930 | 15.6 | 5950 | 14 | 1.11 |
110 | 10 | MA0018.4 | CREB1 | MA0018.4 | NNRTGATGTCAYN | CREB1 | 1.04e-07 | 5.59e-10 | 19200 | 43.3 | 17400 | 41.2 | 1.05 |
111 | 5 | MA0887.1 | EVX1 | MA0887.1 | VVTAATTABS | EVX1 | 1.48e-07 | 7.85e-10 | 30000 | 67.6 | 27700 | 65.6 | 1.03 |
112 | 5 | MA0709.1 | Msx3 | MA0709.1 | SCAATTAN | Msx3 | 1.82e-07 | 9.58e-10 | 23600 | 53 | 21500 | 50.9 | 1.04 |
113 | 5 | MA0722.1 | VAX1 | MA0722.1 | YTAATTAH | VAX1 | 2.48e-07 | 1.29e-09 | 25100 | 56.6 | 23100 | 54.5 | 1.04 |
114 | 5 | MA0654.1 | ISX | MA0654.1 | YYAATTAR | ISX | 3.63e-07 | 1.87e-09 | 23900 | 53.9 | 21900 | 51.8 | 1.04 |
115 | 5 | MA0880.1 | Dlx3 | MA0880.1 | NYAATTAN | Dlx3 | 4.06e-07 | 2.08e-09 | 24900 | 56.1 | 22900 | 54 | 1.04 |
116 | 5 | MA0710.1 | NOTO | MA0710.1 | NYTAATTAVN | NOTO | 6.04e-07 | 3.06e-09 | 24900 | 56.1 | 22900 | 54.1 | 1.04 |
117 | 19 | MA0611.2 | Dux | MA0611.2 | NTGATTBAATCAVWNN | Dux | 7.78e-07 | 3.91e-09 | 13600 | 30.6 | 12200 | 28.7 | 1.06 |
118 | 5 | MA1476.2 | Dlx5 | MA1476.2 | NDGCAATTAGNN | Dlx5 | 8.72e-07 | 4.35e-09 | 37600 | 84.6 | 35200 | 83.1 | 1.02 |
119 | 20 | MA0797.1 | TGIF2 | MA0797.1 | TGACAGSTGTCA | TGIF2 | 8.86e-07 | 4.38e-09 | 13600 | 30.7 | 12200 | 28.8 | 1.06 |
120 | 21 | MA0090.3 | TEAD1 | MA0090.3 | NNACATTCCAGSN | TEAD1 | 1.05e-06 | 5.15e-09 | 33000 | 74.2 | 30600 | 72.4 | 1.02 |
121 | 5 | MA0704.1 | Lhx4 | MA0704.1 | YTAATTAR | Lhx4 | 1.21e-06 | 5.87e-09 | 37200 | 83.6 | 34700 | 82.1 | 1.02 |
122 | 5 | MA0700.2 | LHX2 | MA0700.2 | DDCCAATTADN | LHX2 | 1.38e-06 | 6.62e-09 | 33900 | 76.2 | 31500 | 74.5 | 1.02 |
123 | 15 | MA0042.2 | FOXI1 | MA0042.2 | GTAAACA | FOXI1 | 1.38e-06 | 6.62e-09 | 35200 | 79.2 | 32800 | 77.6 | 1.02 |
124 | 22 | MA1124.1 | ZNF24 | MA1124.1 | CATTCATTCATTC | ZNF24 | 2.21e-06 | 1.05e-08 | 22700 | 51.2 | 20800 | 49.2 | 1.04 |
125 | 5 | MA0885.2 | Dlx2 | MA0885.2 | NDGCAATTAGNN | Dlx2 | 3.34e-06 | 1.57e-08 | 36800 | 82.9 | 34400 | 81.4 | 1.02 |
126 | 5 | MA0658.1 | LHX6 | MA0658.1 | RCTAATTARY | LHX6 | 3.35e-06 | 1.57e-08 | 18700 | 42.2 | 17000 | 40.2 | 1.05 |
127 | 5 | MA0889.1 | GBX1 | MA0889.1 | VCYAATTASY | GBX1 | 4.57e-06 | 2.12e-08 | 18400 | 41.3 | 16700 | 39.4 | 1.05 |
128 | 23 | MA0495.3 (MAFF) | STREME-7 | 7-CTGTCAGCACT | CTGTCAGCACT | MAFF | 4.65e-06 | 2.14e-08 | 1140 | 2.57 | 843 | 1.99 | 1.29 |
129 | 24 | MA1114.1 | PBX3 | MA1114.1 | NNNTGAGTGACAGGBNN | PBX3 | 4.69e-06 | 2.14e-08 | 14500 | 32.6 | 13000 | 30.8 | 1.06 |
130 | 5 | MA0793.1 | POU6F2 | MA0793.1 | ASCTMATTAW | POU6F2 | 4.88e-06 | 2.21e-08 | 23800 | 53.6 | 21900 | 51.7 | 1.04 |
131 | 5 | MA1960.1 | MGA::EVX1 | MA1960.1 | RGGTGWTAATKD | MGA::EVX1 | 4.95e-06 | 2.22e-08 | 30100 | 67.6 | 27800 | 65.8 | 1.03 |
132 | 8 | MA0032.2 | FOXC1 | MA0032.2 | WAWGTAAAYAW | FOXC1 | 6.47e-06 | 2.88e-08 | 27300 | 61.5 | 25200 | 59.6 | 1.03 |
133 | 18 | MA0790.1 | POU4F1 | MA0790.1 | ATGMATAATTAATG | POU4F1 | 6.66e-06 | 2.95e-08 | 27800 | 62.5 | 25700 | 60.6 | 1.03 |
134 | 8 | MA0480.2 | Foxo1 | MA0480.2 | NDGTAAACANN | Foxo1 | 7e-06 | 3.07e-08 | 40300 | 90.7 | 37900 | 89.5 | 1.01 |
135 | 25 | MA0755.1 | CUX2 | MA0755.1 | TRATCGATAW | CUX2 | 7.1e-06 | 3.09e-08 | 24000 | 53.9 | 22000 | 52 | 1.04 |
136 | 5 | MA0890.1 | GBX2 | MA0890.1 | RSYAATTARN | GBX2 | 7.27e-06 | 3.15e-08 | 23800 | 53.5 | 21800 | 51.6 | 1.04 |
137 | 14 | MA0861.1 | TP73 | MA0861.1 | DRCATGTCNNRACAYGYM | TP73 | 7.8e-06 | 3.35e-08 | 7720 | 17.4 | 6740 | 15.9 | 1.09 |
138 | 20 | MA0774.1 | MEIS2 | MA0774.1 | TTGACAGS | MEIS2 | 7.9e-06 | 3.37e-08 | 4620 | 10.4 | 3920 | 9.27 | 1.12 |
139 | 6 | MA1618.1 | Ptf1a | MA1618.1 | NNACAGATGTTNN | Ptf1a | 9.02e-06 | 3.82e-08 | 36900 | 83.1 | 34600 | 81.7 | 1.02 |
140 | 6 | MA1570.1 | TFAP4 | MA1570.1 | AHCATRTGDT | TFAP4 | 9.34e-06 | 3.92e-08 | 15400 | 34.6 | 13900 | 32.8 | 1.05 |
141 | 5 | MA0488.1 | JUN | MA0488.1 | DDRATGATGTCAT | JUN | 1.01e-05 | 4.21e-08 | 7520 | 16.9 | 6570 | 15.5 | 1.09 |
142 | 8 | MA1607.1 | Foxl2 | MA1607.1 | DDWATGTAAACANN | Foxl2 | 1.06e-05 | 4.4e-08 | 37700 | 84.8 | 35300 | 83.4 | 1.02 |
143 | 38 | MA1107.2 | KLF9 | MA1107.2 | MNGCCACACCCACHYV | KLF9 | 1.1e-05 | 4.52e-08 | 1990 | 4.48 | 1580 | 3.73 | 1.2 |
144 | 26 | MA0805.1 | TBX1 | MA0805.1 | AGGTGTGA | TBX1 | 1.15e-05 | 4.7e-08 | 2260 | 5.09 | 1810 | 4.29 | 1.19 |
145 | 5 | MA0723.2 | VAX2 | MA0723.2 | BYAATTAV | VAX2 | 1.25e-05 | 5.08e-08 | 38500 | 86.7 | 36100 | 85.4 | 1.02 |
146 | 6 | MA0871.2 | TFEC | MA0871.2 | NVCACGTGACN | TFEC | 1.61e-05 | 6.5e-08 | 10600 | 23.9 | 9430 | 22.3 | 1.07 |
147 | 5 | MA0075.3 | PRRX2 | MA0075.3 | CTAATTAN | PRRX2 | 2.05e-05 | 8.2e-08 | 17600 | 39.7 | 16000 | 37.9 | 1.05 |
148 | 24 | MA0063.2 | NKX2-5 | MA0063.2 | NNCACTCAANN | NKX2-5 | 2.18e-05 | 8.68e-08 | 41000 | 92.2 | 38600 | 91.2 | 1.01 |
149 | 6 | MA0461.2 | Atoh1 | MA0461.2 | RMCATATGBY | Atoh1 | 4.03e-05 | 1.59e-07 | 30900 | 69.5 | 28700 | 67.8 | 1.02 |
150 | 17 | MA0514.2 | Sox3 | MA0514.2 | DNACAATGGNN | Sox3 | 4.17e-05 | 1.64e-07 | 40100 | 90.2 | 37700 | 89 | 1.01 |
151 | 5 | MA0699.1 | LBX2 | MA0699.1 | NCYAATTARN | LBX2 | 4.48e-05 | 1.74e-07 | 21300 | 48 | 19500 | 46.2 | 1.04 |
152 | 14 | MA0525.2 | TP63 | MA0525.2 | RACATGTYGKGACATGTC | TP63 | 4.56e-05 | 1.76e-07 | 404 | 0.91 | 252 | 0.6 | 1.52 |
153 | 27 | MA0060.3 | NFYA | MA0060.3 | RRCCAATCAGM | NFYA | 4.8e-05 | 1.85e-07 | 39400 | 88.8 | 37100 | 87.6 | 1.01 |
154 | 5 | MA1480.1 | DPRX | MA1480.1 | AGATAATCCN | DPRX | 5.72e-05 | 2.18e-07 | 35500 | 79.8 | 33200 | 78.4 | 1.02 |
155 | 8 | MA0851.1 | Foxj3 | MA0851.1 | NDAADGTAAACAAANNM | Foxj3 | 6.97e-05 | 2.65e-07 | 24900 | 56 | 22900 | 54.2 | 1.03 |
156 | 5 | MA0881.1 | Dlx4 | MA0881.1 | VYAATTAN | Dlx4 | 8.13e-05 | 3.07e-07 | 38400 | 86.4 | 36000 | 85.1 | 1.01 |
157 | 5 | MA1499.1 | HOXB4 | MA1499.1 | RTMATTAR | HOXB4 | 8.71e-05 | 3.26e-07 | 17100 | 38.5 | 15600 | 36.8 | 1.05 |
158 | 24 | MA0782.2 | PKNOX1 | MA0782.2 | NNTGAGTGACAGVNN | PKNOX1 | 9.04e-05 | 3.36e-07 | 9480 | 21.3 | 8420 | 19.9 | 1.07 |
159 | 5 | MA1481.1 | DRGX | MA1481.1 | YYAATTAN | DRGX | 0.000114 | 4.22e-07 | 34600 | 78 | 32400 | 76.5 | 1.02 |
160 | 5 | MA0720.1 | Shox2 | MA0720.1 | YYAATTAR | Shox2 | 0.000129 | 4.73e-07 | 16600 | 37.3 | 15100 | 35.6 | 1.05 |
161 | 5 | MA1496.1 | HOXA4 | MA1496.1 | RTMATTAV | HOXA4 | 0.000157 | 5.74e-07 | 27400 | 61.7 | 25400 | 60 | 1.03 |
162 | 13 | MA0036.3 | GATA2 | MA0036.3 | NBCTTATCTNH | GATA2 | 0.000198 | 7.19e-07 | 24100 | 54.2 | 22200 | 52.5 | 1.03 |
163 | 5 | MA1500.1 | HOXB6 | MA1500.1 | BTAATKRC | HOXB6 | 0.000202 | 7.3e-07 | 29400 | 66.2 | 27300 | 64.6 | 1.03 |
164 | 28 | MA0596.1 | SREBF2 | MA0596.1 | RTGGGGTGAY | SREBF2 | 0.00033 | 1.18e-06 | 5520 | 12.4 | 4800 | 11.3 | 1.1 |
165 | 15 | MA0613.1 | FOXG1 | MA0613.1 | RTAAACAW | FOXG1 | 0.000345 | 1.23e-06 | 18400 | 41.5 | 16900 | 39.9 | 1.04 |
166 | 17 | MA0143.4 | SOX2 | MA0143.4 | DNACAATGGNN | SOX2 | 0.000355 | 1.26e-06 | 41600 | 93.5 | 39200 | 92.7 | 1.01 |
167 | 17 | MA1563.2 | SOX18 | MA1563.2 | AACAATDV | SOX18 | 0.00038 | 1.34e-06 | 31900 | 71.8 | 29800 | 70.3 | 1.02 |
168 | 5 | MA0680.2 | Pax7 | MA0680.2 | DWTAATCAATTAWNH | Pax7 | 0.000392 | 1.37e-06 | 26500 | 59.6 | 24500 | 57.9 | 1.03 |
169 | 5 | MA0888.1 | EVX2 | MA0888.1 | VVTAATTAVB | EVX2 | 0.000392 | 1.37e-06 | 29800 | 67.1 | 27700 | 65.5 | 1.02 |
170 | 23 | MA0495.3 | MAFF | MA0495.3 | DWGTCAGCATTTTWHN | MAFF | 0.000413 | 1.43e-06 | 37000 | 83.4 | 34700 | 82.1 | 1.02 |
171 | 5 | MA0892.1 | GSX1 | MA0892.1 | NCYMATTAVN | GSX1 | 0.000427 | 1.47e-06 | 43200 | 97.2 | 40900 | 96.6 | 1.01 |
172 | 27 | MA0035.4 | GATA1 | MA0035.4 | NHCTAATCTDH | GATA1 | 0.000514 | 1.76e-06 | 19800 | 44.6 | 18200 | 42.9 | 1.04 |
173 | 5 | MA0716.1 | PRRX1 | MA0716.1 | YYAATTAR | PRRX1 | 0.000542 | 1.84e-06 | 37900 | 85.3 | 35600 | 84.1 | 1.01 |
174 | 17 | MA0077.1 | SOX9 | MA0077.1 | CYATTGTTY | SOX9 | 0.00069 | 2.33e-06 | 41100 | 92.5 | 38800 | 91.6 | 1.01 |
175 | 17 | MA1562.1 | SOX14 | MA1562.1 | CCGAACAATG | SOX14 | 0.000693 | 2.33e-06 | 25200 | 56.8 | 23300 | 55.2 | 1.03 |
176 | 26 | MA0803.1 | TBX15 | MA0803.1 | AGGTGTGA | TBX15 | 0.000781 | 2.61e-06 | 29600 | 66.6 | 27500 | 65.1 | 1.02 |
177 | 5 | MA0628.1 | POU6F1 | MA0628.1 | AYTAATTART | POU6F1 | 0.000872 | 2.9e-06 | 27700 | 62.4 | 25700 | 60.8 | 1.03 |
178 | 20 | MA0691.1 | TFAP4 | MA0691.1 | AWCAGCTGWT | TFAP4 | 0.00109 | 3.61e-06 | 4140 | 9.3 | 3550 | 8.39 | 1.11 |
179 | 5 | MA0630.1 | SHOX | MA0630.1 | HTAATTRR | SHOX | 0.00119 | 3.9e-06 | 40800 | 91.8 | 38500 | 90.9 | 1.01 |
180 | 29 | MA0073.1 | RREB1 | MA0073.1 | CCCCMAAMCAMCCMCMMMCV | RREB1 | 0.00135 | 4.41e-06 | 856 | 1.93 | 640 | 1.51 | 1.27 |
181 | 1 | MA0070.1 | PBX1 | MA0070.1 | HCATCAATCAAW | PBX1 | 0.00162 | 5.27e-06 | 15700 | 35.2 | 14300 | 33.7 | 1.04 |
182 | 8 | MA0897.1 | Hmx2 | MA0897.1 | ASAAGCAATTAAHVVRT | Hmx2 | 0.0019 | 6.14e-06 | 24100 | 54.2 | 22300 | 52.6 | 1.03 |
183 | 30 | MA1573.2 | Thap11 | MA1573.2 | ACTACAABTCCCAG | Thap11 | 0.0024 | 7.72e-06 | 140 | 0.32 | 69 | 0.16 | 1.92 |
184 | 8 | MA0157.3 | Foxo3 | MA0157.3 | NDGTAAACARNN | Foxo3 | 0.00291 | 9.32e-06 | 42200 | 95.1 | 39900 | 94.4 | 1.01 |
185 | 31 | MA0698.1 | ZBTB18 | MA0698.1 | NATCCAGATGTKB | ZBTB18 | 0.00339 | 1.08e-05 | 10500 | 23.6 | 9430 | 22.3 | 1.06 |
186 | 18 | MA0153.2 | HNF1B | MA0153.2 | GTTAATNATTAAY | HNF1B | 0.00369 | 1.17e-05 | 17400 | 39.2 | 15900 | 37.7 | 1.04 |
187 | 15 | MA0031.1 | FOXD1 | MA0031.1 | GTAAACAW | FOXD1 | 0.00389 | 1.22e-05 | 28800 | 64.9 | 26800 | 63.4 | 1.02 |
188 | 17 | MA1152.1 | SOX15 | MA1152.1 | CYWTTGTTHW | SOX15 | 0.00393 | 1.23e-05 | 36400 | 81.8 | 34100 | 80.7 | 1.01 |
189 | 24 | MA1645.1 | NKX2-2 | MA1645.1 | NNNCCACTCAANNN | NKX2-2 | 0.00434 | 1.35e-05 | 7120 | 16 | 6320 | 14.9 | 1.07 |
190 | 31 | MA0818.2 | BHLHE22 | MA0818.2 | AMCATATGKY | BHLHE22 | 0.00443 | 1.37e-05 | 9680 | 21.8 | 8690 | 20.6 | 1.06 |
13 | 34 | MA0682.2 (PITX1) | MEME-3 | GCCTGTAATCCCAGC | GCCTGTAATCCCAGC | PITX1 | 0.00449 | 0.000345 | 1360 | 3.06 | 1130 | 2.66 | 1.15 |
191 | 18 | MA0046.2 | HNF1A | MA0046.2 | DRTTAATNATTAACN | HNF1A | 0.00527 | 1.62e-05 | 18400 | 41.4 | 16900 | 40 | 1.04 |
192 | 5 | MA1608.1 | Isl1 | MA1608.1 | NNCCATTAGNN | Isl1 | 0.00606 | 1.86e-05 | 40700 | 91.6 | 38400 | 90.8 | 1.01 |
193 | 32 | MA1974.1 | ZNF211 | MA1974.1 | HWYATATACCAYRYW | ZNF211 | 0.00653 | 1.99e-05 | 11200 | 25.2 | 10100 | 24 | 1.05 |
194 | 13 | MA1104.2 | GATA6 | MA1104.2 | HWWTCTTATCTNH | GATA6 | 0.00671 | 2.04e-05 | 21900 | 49.2 | 20200 | 47.7 | 1.03 |
195 | 15 | MA0849.1 | FOXO6 | MA0849.1 | GTAAACA | FOXO6 | 0.00813 | 2.45e-05 | 24400 | 54.9 | 22600 | 53.4 | 1.03 |
196 | 28 | MA0595.1 | SREBF1 | MA0595.1 | VTCACCCCAY | SREBF1 | 0.00886 | 2.66e-05 | 2580 | 5.81 | 2180 | 5.15 | 1.13 |
197 | 6 | MA0004.1 | Arnt | MA0004.1 | CACGTG | Arnt | 0.00929 | 2.78e-05 | 752 | 1.69 | 566 | 1.34 | 1.26 |
198 | 33 | MA0052.4 | MEF2A | MA0052.4 | DDCTAAAAATAGMHH | MEF2A | 0.0142 | 4.2e-05 | 33800 | 76 | 31600 | 74.8 | 1.02 |
199 | 26 | MA0689.1 | TBX20 | MA0689.1 | WAGGTGTGAAR | TBX20 | 0.0142 | 4.21e-05 | 6840 | 15.4 | 6080 | 14.4 | 1.07 |
200 | 20 | MA1572.1 | TGIF2LY | MA1572.1 | TGACAGCTGTCA | TGIF2LY | 0.0145 | 4.27e-05 | 36200 | 81.6 | 34000 | 80.5 | 1.01 |
201 | 5 | MA0713.1 | PHOX2A | MA0713.1 | TAATYYAATTA | PHOX2A | 0.0174 | 5.09e-05 | 28200 | 63.4 | 26200 | 62 | 1.02 |
202 | 5 | MA0914.1 | ISL2 | MA0914.1 | GCAMTTAR | ISL2 | 0.0199 | 5.81e-05 | 30700 | 69.1 | 28700 | 67.8 | 1.02 |
203 | 26 | MA1567.2 | Tbx6 | MA1567.2 | NRGGTGTGAANN | Tbx6 | 0.0203 | 5.89e-05 | 5620 | 12.7 | 4970 | 11.8 | 1.08 |
204 | 5 | MA0674.1 | NKX6-1 | MA0674.1 | NTMATTAA | NKX6-1 | 0.0224 | 6.45e-05 | 29000 | 65.3 | 27100 | 64 | 1.02 |
205 | 15 | MA0850.1 | FOXP3 | MA0850.1 | RTAAACA | FOXP3 | 0.0245 | 7.03e-05 | 30100 | 67.8 | 28100 | 66.5 | 1.02 |
206 | 27 | MA1644.1 | NFYC | MA1644.1 | RDCCAATCASN | NFYC | 0.0248 | 7.07e-05 | 39800 | 89.5 | 37500 | 88.6 | 1.01 |
207 | 9 | MA0025.2 | NFIL3 | MA0025.2 | NRTTATGYAATNN | NFIL3 | 0.0252 | 7.16e-05 | 18100 | 40.8 | 16700 | 39.5 | 1.03 |
208 | 26 | MA0800.1 | EOMES | MA0800.1 | RAGGTGTGAAAWN | EOMES | 0.0298 | 8.43e-05 | 37500 | 84.3 | 35200 | 83.3 | 1.01 |
209 | 26 | MA0802.1 | TBR1 | MA0802.1 | AGGTGTGAAA | TBR1 | 0.0323 | 9.1e-05 | 38000 | 85.5 | 35800 | 84.5 | 1.01 |
210 | 5 | MA0642.2 | EN2 | MA0642.2 | VSYAATTA | EN2 | 0.0422 | 0.000118 | 29400 | 66.2 | 27500 | 65 | 1.02 |
14 | 35 | MA1973.1 (ZKSCAN3) | MEME-6 | TGCCCAGGCTG | TGCCCAGGCTG | ZKSCAN3 | 2.1 | 0.15 | 532 | 1.2 | 470 | 1.11 | 1.08 |
15 | 36 | MA0471.2 (E2F6) | MEME-7 | DGAGGARGGAR | DGAGGARGGAR | E2F6 | 3.33 | 0.222 | 14000 | 31.4 | 13200 | 31.2 | 1.01 |
16 | 37 | MA1125.1 (ZNF384) | MEME-1 | AAAAAAAAAAAADWA | AAAAAAAAAAAADWA | ZNF384 | 3.75 | 0.235 | 1610 | 3.62 | 1490 | 3.52 | 1.03 |
17 | 38 | MA1961.1 (PATZ1) | MEME-5 | GGGRGGCGGRGGCGG | GGGRGGCGGRGGCGG | PATZ1 | 17 | 1 | 9820 | 22.1 | 10400 | 24.5 | 0.902 |
DNR_24_20kb_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DNR_24_20kb_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DNR_24_20kb_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
dplyr::filter(EVALUE.y < 0.05) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
# slice_head(n=7) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DNR_24_20kb_combo response peaks(200bp)Enrichment ratio:",ER_rat))
MTX_24_20kb_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/MTX_24_20kb_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
MTX_24_20kb_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/MTX_24_20kb_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
MTX_24_20kb_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/MTX_24_20kb_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
MTX_24_20kb_sea_all <- rbind(MTX_24_20kb_sea_disc, MTX_24_20kb_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
MTX_24_20kb_combo <- MTX_24_20kb_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., MTX_24_20kb_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
MTX_24_20kb_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in MTX v VEH") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA1513.1 (KLF15) | STREME-1 | 1-GCCCCGCCC | GCCCCGCCC | KLF15 | 8.57e-69 | 5.97e-69 | 1450 | 10.4 | 4390 | 6.02 | 1.73 |
2 | 2 | MA0060.3 (NFYA) | STREME-3 | 3-CYGATTGGYYRSN | CYGATTGGYYRSN | NFYA | 2.14e-56 | 7.46e-57 | 1030 | 7.45 | 2980 | 4.09 | 1.82 |
3 | 1 | MA0516.3 | SP2 | MA0516.3 | GGGGCGGGG | SP2 | 6.53e-55 | 1.52e-55 | 2080 | 15 | 7400 | 10.2 | 1.47 |
4 | 3 | 9-CCGVSBCGG | STREME-9 | 9-CCGVSBCGG | CCGVSBCGG | CCGVSBCGG | 1.53e-52 | 2.67e-53 | 2950 | 21.2 | 11400 | 15.7 | 1.35 |
5 | 4 | MA0079.5 (SP1) | MEME-3 | CSCCGCCSCSSCCSC | CSCCGCCSCSSCCSC | SP1 | 6.19e-52 | 8.63e-53 | 2720 | 19.6 | 10400 | 14.2 | 1.37 |
6 | 1 | MA1513.1 | KLF15 | MA1513.1 | SCCCCGCCCCS | KLF15 | 8.57e-51 | 9.96e-52 | 1630 | 11.7 | 5550 | 7.61 | 1.54 |
7 | 2 | MA0502.2 | NFYB | MA0502.2 | CYCATTGGCCVV | NFYB | 1.05e-49 | 1.04e-50 | 943 | 6.79 | 2740 | 3.76 | 1.81 |
8 | 1 | MA1564.1 | SP9 | MA1564.1 | RCCACGCCCMCY | SP9 | 1.39e-46 | 1.21e-47 | 3150 | 22.7 | 12600 | 17.3 | 1.31 |
9 | 1 | MA0685.2 | SP4 | MA0685.2 | GGGGCGGGG | SP4 | 1.83e-46 | 1.42e-47 | 1110 | 7.98 | 3470 | 4.76 | 1.68 |
10 | 1 | MA0746.2 | SP3 | MA0746.2 | NGCCACGCCCMCH | SP3 | 2.38e-46 | 1.66e-47 | 1860 | 13.4 | 6710 | 9.21 | 1.46 |
11 | 5 | MA0733.1 (EGR4) | STREME-4 | 4-CCGCCCGCG | CCGCCCGCG | EGR4 | 1.11e-45 | 7.01e-47 | 3160 | 22.8 | 12700 | 17.4 | 1.31 |
12 | 6 | MA0476.1 (FOS) | STREME-2 | 2-GATGASTCA | GATGASTCA | FOS | 8.95e-45 | 5.2e-46 | 1570 | 11.3 | 5480 | 7.52 | 1.51 |
13 | 6 | MA1128.1 | FOSL1::JUN | MA1128.1 | NKATGACTCATNN | FOSL1::JUN | 1.46e-44 | 7.83e-46 | 1820 | 13.1 | 6580 | 9.03 | 1.46 |
14 | 6 | MA0477.2 | FOSL1 | MA0477.2 | NNATGACTCATNN | FOSL1 | 4.16e-44 | 2.07e-45 | 1810 | 13 | 6530 | 8.96 | 1.46 |
15 | 6 | MA1928.1 | BNC2 | MA1928.1 | NDRTGAGTCABN | BNC2 | 1.52e-43 | 7.06e-45 | 2060 | 14.9 | 7680 | 10.6 | 1.41 |
16 | 1 | MA1961.1 | PATZ1 | MA1961.1 | SSGGGGMGGGGS | PATZ1 | 3.3e-43 | 1.44e-44 | 2020 | 14.6 | 7520 | 10.3 | 1.41 |
17 | 6 | MA0099.3 | FOS::JUN | MA0099.3 | ATGAGTCAYM | FOS::JUN | 5e-43 | 2.05e-44 | 1680 | 12.1 | 5990 | 8.22 | 1.47 |
18 | 6 | MA1633.2 | BACH1 | MA1633.2 | ATGACTCAT | BACH1 | 6.87e-42 | 2.66e-43 | 2650 | 19.1 | 10500 | 14.3 | 1.33 |
19 | 6 | MA1141.1 | FOS::JUND | MA1141.1 | NKATGAGTCATNN | FOS::JUND | 9.53e-42 | 3.5e-43 | 1710 | 12.3 | 6140 | 8.43 | 1.46 |
20 | 6 | MA1137.1 | FOSL1::JUNB | MA1137.1 | NDRTGACTCATNN | FOSL1::JUNB | 6.5e-41 | 2.27e-42 | 2740 | 19.7 | 10900 | 14.9 | 1.32 |
21 | 6 | MA1135.1 | FOSB::JUNB | MA1135.1 | KRTGASTCAT | FOSB::JUNB | 7.56e-41 | 2.51e-42 | 1760 | 12.7 | 6400 | 8.79 | 1.44 |
22 | 1 | MA0740.2 | KLF14 | MA0740.2 | KGGGCGGGG | KLF14 | 9.17e-40 | 2.91e-41 | 1180 | 8.52 | 3930 | 5.39 | 1.58 |
23 | 1 | MA1653.1 | ZNF148 | MA1653.1 | YBCCCCTCCCCC | ZNF148 | 1.22e-39 | 3.69e-41 | 3330 | 24 | 13700 | 18.8 | 1.27 |
24 | 1 | MA1959.1 | KLF7 | MA1959.1 | DGGGCGGGG | KLF7 | 1.78e-39 | 5.16e-41 | 1470 | 10.6 | 5150 | 7.07 | 1.49 |
25 | 1 | MA0742.2 | KLF12 | MA0742.2 | GGGGCGGGG | KLF12 | 3.3e-39 | 9.2e-41 | 986 | 7.1 | 3120 | 4.28 | 1.66 |
26 | 6 | MA0491.2 | JUND | MA0491.2 | NNATGACTCATNN | JUND | 3.59e-39 | 9.64e-41 | 2700 | 19.4 | 10800 | 14.8 | 1.32 |
27 | 6 | MA0462.2 | BATF::JUN | MA0462.2 | DATGACTCATH | BATF::JUN | 9.37e-39 | 2.42e-40 | 2170 | 15.6 | 8310 | 11.4 | 1.37 |
28 | 6 | MA1634.1 | BATF | MA1634.1 | DATGACTCATH | BATF | 3.32e-37 | 8.06e-39 | 2100 | 15.2 | 8080 | 11.1 | 1.37 |
29 | 6 | MA0478.1 | FOSL2 | MA0478.1 | KRRTGASTCAB | FOSL2 | 3.35e-37 | 8.06e-39 | 1610 | 11.6 | 5830 | 8.01 | 1.45 |
30 | 6 | MA1130.1 | FOSL2::JUN | MA1130.1 | NNRTGAGTCAYN | FOSL2::JUN | 5.03e-37 | 1.17e-38 | 1490 | 10.7 | 5340 | 7.32 | 1.47 |
31 | 6 | MA0835.2 | BATF3 | MA0835.2 | DATGACTCATH | BATF3 | 8.8e-37 | 1.98e-38 | 2070 | 14.9 | 7950 | 10.9 | 1.37 |
32 | 1 | MA0741.1 | KLF16 | MA0741.1 | GMCACGCCCCC | KLF16 | 2.5e-36 | 5.45e-38 | 2900 | 20.9 | 11800 | 16.2 | 1.29 |
33 | 1 | MA1511.2 | KLF10 | MA1511.2 | GGGGCGGGG | KLF10 | 3.84e-36 | 8.11e-38 | 1110 | 8 | 3720 | 5.1 | 1.57 |
34 | 7 | MA1522.1 | MAZ | MA1522.1 | BBCCCCTCCCY | MAZ | 2.06e-35 | 4.23e-37 | 3090 | 22.3 | 12800 | 17.6 | 1.27 |
35 | 6 | MA1132.1 | JUN::JUNB | MA1132.1 | KATGACKCAT | JUN::JUNB | 8.32e-35 | 1.66e-36 | 2560 | 18.4 | 10300 | 14.1 | 1.31 |
36 | 8 | MA1982.1 (ZNF574) | STREME-5 | 5-SGGCCS | SGGCCS | ZNF574 | 1.16e-34 | 2.24e-36 | 3100 | 22.3 | 12800 | 17.6 | 1.27 |
37 | 2 | MA0060.3 | NFYA | MA0060.3 | RRCCAATCAGM | NFYA | 4.31e-34 | 8.12e-36 | 490 | 3.53 | 1270 | 1.74 | 2.03 |
38 | 6 | MA0490.2 | JUNB | MA0490.2 | NNATGACTCATNN | JUNB | 4.4e-33 | 8.06e-35 | 2830 | 20.4 | 11600 | 16 | 1.28 |
39 | 1 | MA1512.1 | KLF11 | MA1512.1 | SCCACGCCCMC | KLF11 | 6.74e-33 | 1.21e-34 | 5160 | 37.1 | 23100 | 31.7 | 1.17 |
40 | 9 | MA1122.1 | TFDP1 | MA1122.1 | VSGCGGGAAVN | TFDP1 | 1.2e-32 | 2.09e-34 | 3400 | 24.5 | 14400 | 19.7 | 1.24 |
41 | 7 | MA1713.1 | ZNF610 | MA1713.1 | SSCGCCGCTCCSSS | ZNF610 | 1.81e-32 | 3.07e-34 | 2330 | 16.8 | 9300 | 12.8 | 1.31 |
42 | 6 | MA1101.2 | BACH2 | MA1101.2 | DWANCATGASTCATSNTWH | BACH2 | 3.56e-32 | 5.91e-34 | 2580 | 18.6 | 10500 | 14.4 | 1.29 |
43 | 1 | MA0079.5 | SP1 | MA0079.5 | GGGGCGGGG | SP1 | 5.09e-32 | 8.25e-34 | 692 | 4.98 | 2080 | 2.86 | 1.75 |
44 | 10 | MA0162.4 | EGR1 | MA0162.4 | VCMCGCCCACGCVS | EGR1 | 1.14e-31 | 1.8e-33 | 4140 | 29.8 | 18100 | 24.8 | 1.2 |
45 | 11 | 14-MGGSCCK | STREME-14 | 14-MGGSCCK | 14-MGGSCCK | MGGSCCK | 2.45e-31 | 3.8e-33 | 8030 | 57.8 | 38000 | 52.2 | 1.11 |
46 | 6 | MA0476.1 | FOS | MA0476.1 | DVTGASTCATB | FOS | 3.57e-31 | 5.41e-33 | 1110 | 7.99 | 3840 | 5.27 | 1.52 |
47 | 2 | MA1644.1 | NFYC | MA1644.1 | RDCCAATCASN | NFYC | 1.84e-30 | 2.73e-32 | 566 | 4.08 | 1620 | 2.22 | 1.84 |
48 | 1 | MA0493.2 | KLF1 | MA0493.2 | DGGGYGGGG | KLF1 | 3.49e-30 | 5.07e-32 | 1960 | 14.1 | 7700 | 10.6 | 1.34 |
49 | 12 | MA1712.1 | ZNF454 | MA1712.1 | DGGCTCSSGGCCCYGVKG | ZNF454 | 1.76e-29 | 2.5e-31 | 5370 | 38.7 | 24400 | 33.4 | 1.16 |
50 | 13 | MA0146.2 | Zfx | MA0146.2 | SSSGCCBVGGCCTS | Zfx | 1.84e-29 | 2.56e-31 | 6220 | 44.8 | 28700 | 39.4 | 1.14 |
51 | 10 | MA1107.2 | KLF9 | MA1107.2 | MNGCCACACCCACHYV | KLF9 | 1.01e-28 | 1.39e-30 | 5580 | 40.2 | 25500 | 34.9 | 1.15 |
52 | 14 | MA0506.2 | Nrf1 | MA0506.2 | SNCTGCGCMTGCGCV | Nrf1 | 1.28e-28 | 1.72e-30 | 4810 | 34.6 | 21600 | 29.6 | 1.17 |
53 | 1 | MA0039.4 | KLF4 | MA0039.4 | NNCCCCACCCHN | KLF4 | 1.09e-27 | 1.43e-29 | 2320 | 16.7 | 9450 | 13 | 1.29 |
54 | 2 | MA0502.2 (NFYB) | STREME-8 | 8-CCATTGGC | CCATTGGC | NFYB | 1.32e-27 | 1.71e-29 | 371 | 2.67 | 929 | 1.27 | 2.1 |
55 | 15 | MA0814.2 | TFAP2C | MA0814.2 | NNGCCTGAGGCGNN | TFAP2C | 6.84e-27 | 8.67e-29 | 5850 | 42.1 | 27000 | 37 | 1.14 |
56 | 1 | MA0599.1 | KLF5 | MA0599.1 | GCCCCDCCCH | KLF5 | 1.16e-26 | 1.44e-28 | 528 | 3.8 | 1540 | 2.11 | 1.81 |
57 | 6 | MA0489.2 | Jun | MA0489.2 | NVTGACTCATNN | Jun | 3.53e-26 | 4.31e-28 | 1020 | 7.35 | 3590 | 4.93 | 1.49 |
58 | 16 | MA1986.1 | ZNF692 | MA1986.1 | BGGGCCCASN | ZNF692 | 1.51e-24 | 1.81e-26 | 4380 | 31.6 | 19700 | 27 | 1.17 |
59 | 4 | MA1721.1 | ZNF93 | MA1721.1 | GGYAGCRGCAGCGGYG | ZNF93 | 2.01e-24 | 2.38e-26 | 2520 | 18.1 | 10500 | 14.5 | 1.25 |
60 | 10 | MA0733.1 | EGR4 | MA0733.1 | NHMCGCCCACGCAHWT | EGR4 | 2.4e-24 | 2.79e-26 | 3440 | 24.7 | 15000 | 20.6 | 1.2 |
61 | 1 | MA1517.1 | KLF6 | MA1517.1 | NRCCACGCCCH | KLF6 | 4.95e-24 | 5.66e-26 | 6640 | 47.8 | 31300 | 42.9 | 1.11 |
62 | 10 | MA0732.1 | EGR3 | MA0732.1 | HHMCGCCCACGCAHH | EGR3 | 9.87e-24 | 1.11e-25 | 2620 | 18.9 | 11100 | 15.2 | 1.24 |
63 | 17 | MA1599.1 | ZNF682 | MA1599.1 | BVGGCYAAGCCCCWNY | ZNF682 | 2.57e-23 | 2.85e-25 | 2580 | 18.6 | 10900 | 15 | 1.24 |
64 | 10 | MA0472.2 | EGR2 | MA0472.2 | MCGCCCACGCA | EGR2 | 2.67e-23 | 2.91e-25 | 2730 | 19.7 | 11700 | 16 | 1.23 |
65 | 6 | MA1142.1 | FOSL1::JUND | MA1142.1 | NRTGACTCAT | FOSL1::JUND | 7.05e-23 | 7.56e-25 | 1160 | 8.32 | 4290 | 5.88 | 1.41 |
66 | 18 | 13-CACCSCG | STREME-13 | 13-CACCSCG | 13-CACCSCG | CACCSCG | 1.27e-22 | 1.34e-24 | 1780 | 12.8 | 7130 | 9.78 | 1.31 |
67 | 19 | MA1548.1 | PLAGL2 | MA1548.1 | NGGGCCCCCN | PLAGL2 | 2.85e-22 | 2.96e-24 | 6880 | 49.5 | 32600 | 44.8 | 1.11 |
68 | 1 | MA1515.1 | KLF2 | MA1515.1 | NRCCACRCCCH | KLF2 | 6.95e-22 | 7.12e-24 | 1580 | 11.4 | 6270 | 8.61 | 1.33 |
69 | 10 | MA1650.1 | ZBTB14 | MA1650.1 | SSCCGCGCACVS | ZBTB14 | 1.59e-21 | 1.6e-23 | 2010 | 14.5 | 8290 | 11.4 | 1.27 |
70 | 20 | MA0748.2 | YY2 | MA0748.2 | NVATGGCGGCS | YY2 | 2.04e-20 | 2.03e-22 | 1200 | 8.67 | 4590 | 6.3 | 1.38 |
71 | 1 | MA0747.1 | SP8 | MA0747.1 | RCCACGCCCMCY | SP8 | 3.93e-20 | 3.86e-22 | 1500 | 10.8 | 5940 | 8.15 | 1.32 |
72 | 9 | MA0865.2 | E2F8 | MA0865.2 | TTCCCGCCAHWA | E2F8 | 1.67e-19 | 1.62e-21 | 4420 | 31.9 | 20200 | 27.8 | 1.15 |
73 | 21 | MA1976.1 | ZNF320 | MA1976.1 | DGKGGGRCCMRGGGGCCWGTGHVNNNN | ZNF320 | 1.92e-19 | 1.84e-21 | 3690 | 26.6 | 16600 | 22.7 | 1.17 |
74 | 1 | MA0753.2 | ZNF740 | MA0753.2 | CYRCCCCCCCCAC | ZNF740 | 6.54e-19 | 6.16e-21 | 5250 | 37.8 | 24500 | 33.6 | 1.13 |
75 | 22 | MA0774.1 (MEIS2) | STREME-12 | 12-CCTGTCAGC | 12-CCTGTCAGC | MEIS2 | 1.01e-18 | 9.42e-21 | 1340 | 9.62 | 5260 | 7.22 | 1.33 |
76 | 23 | MA0632.2 | TCFL5 | MA0632.2 | KCACGCGCMC | TCFL5 | 1.04e-18 | 9.5e-21 | 973 | 7.01 | 3620 | 4.96 | 1.41 |
77 | 6 | MA1144.1 | FOSL2::JUND | MA1144.1 | KATGACTCAT | FOSL2::JUND | 5.78e-18 | 5.23e-20 | 743 | 5.35 | 2630 | 3.61 | 1.48 |
78 | 6 | MA0841.1 | NFE2 | MA0841.1 | VATGACTCATS | NFE2 | 9.31e-18 | 8.32e-20 | 763 | 5.49 | 2720 | 3.74 | 1.47 |
79 | 24 | MA0014.3 | PAX5 | MA0014.3 | RRGCGTGACCNN | PAX5 | 8.3e-17 | 7.32e-19 | 2650 | 19.1 | 11600 | 15.9 | 1.2 |
80 | 9 | MA0471.2 | E2F6 | MA0471.2 | VVDGGCGGGAAVV | E2F6 | 8.52e-17 | 7.42e-19 | 3190 | 23 | 14300 | 19.6 | 1.17 |
81 | 25 | MA1573.2 (Thap11) | STREME-6 | 6-ACTACAACTCCC | ACTACAACTCCC | Thap11 | 9.98e-17 | 8.59e-19 | 53 | 0.38 | 39 | 0.05 | 7.08 |
82 | 6 | MA1138.1 | FOSL2::JUNB | MA1138.1 | KRTGASTCAT | FOSL2::JUNB | 3.59e-16 | 3.05e-18 | 700 | 5.04 | 2500 | 3.43 | 1.47 |
83 | 6 | MA0496.3 | MAFK | MA0496.3 | NGMTGACTCAGCMNH | MAFK | 4.56e-16 | 3.83e-18 | 2250 | 16.2 | 9710 | 13.3 | 1.22 |
84 | 6 | MA0655.1 | JDP2 | MA0655.1 | ATGACTCAT | JDP2 | 4.81e-16 | 3.99e-18 | 720 | 5.19 | 2590 | 3.55 | 1.46 |
85 | 6 | MA0089.2 | MAFG::NFE2L1 | MA0089.2 | NVNATGACTCAGCADW | MAFG::NFE2L1 | 5.45e-16 | 4.47e-18 | 2100 | 15.2 | 9020 | 12.4 | 1.23 |
86 | 26 | MA1985.1 | ZNF669 | MA1985.1 | GGGGYGAYGACCRCT | ZNF669 | 5.6e-16 | 4.54e-18 | 5410 | 39 | 25500 | 35 | 1.11 |
87 | 27 | MA1615.1 | Plagl1 | MA1615.1 | NNCTGGGGCCABN | Plagl1 | 1.14e-15 | 9.15e-18 | 7150 | 51.5 | 34600 | 47.5 | 1.09 |
88 | 28 | MA0131.2 | HINFP | MA0131.2 | CARCGTCCGCKN | HINFP | 7.64e-15 | 6.05e-17 | 2000 | 14.4 | 8570 | 11.8 | 1.22 |
89 | 29 | MA0810.1 | TFAP2A | MA0810.1 | YGCCCBVRGGCR | TFAP2A | 1.48e-14 | 1.16e-16 | 6540 | 47.1 | 31500 | 43.2 | 1.09 |
90 | 29 | MA0811.1 | TFAP2B | MA0811.1 | YGCCCBVRGGCA | TFAP2B | 2.83e-14 | 2.19e-16 | 6440 | 46.3 | 31000 | 42.5 | 1.09 |
91 | 23 | MA1099.2 | HES1 | MA1099.2 | GGCRCGTGBC | HES1 | 8.93e-14 | 6.84e-16 | 1050 | 7.56 | 4150 | 5.7 | 1.33 |
92 | 29 | MA0872.1 | TFAP2A | MA0872.1 | TGCCCYSRGGGCA | TFAP2A | 2.79e-13 | 2.11e-15 | 3770 | 27.2 | 17400 | 23.9 | 1.14 |
93 | 1 | MA1630.2 | ZNF281 | MA1630.2 | GGGGGAGGGGVV | ZNF281 | 2.81e-13 | 2.11e-15 | 713 | 5.13 | 2650 | 3.63 | 1.41 |
94 | 29 | MA0524.2 | TFAP2C | MA0524.2 | YGCCYBVRGGCA | TFAP2C | 3.31e-13 | 2.45e-15 | 6060 | 43.7 | 29100 | 40 | 1.09 |
95 | 30 | MA0464.2 | BHLHE40 | MA0464.2 | VKCACGTGMC | BHLHE40 | 8.66e-13 | 6.36e-15 | 5240 | 37.7 | 24900 | 34.2 | 1.1 |
96 | 25 | MA1573.2 | Thap11 | MA1573.2 | ACTACAABTCCCAG | Thap11 | 3.46e-12 | 2.52e-14 | 196 | 1.41 | 516 | 0.71 | 2 |
97 | 6 | MA0501.1 | MAF::NFE2 | MA0501.1 | ATGACTCAGCANWTY | MAF::NFE2 | 4.34e-12 | 3.12e-14 | 1750 | 12.6 | 7520 | 10.3 | 1.22 |
98 | 31 | 10-ACATGCACACACGT | STREME-10 | 10-ACATGCACACACGT | 10-ACATGCACACACGT | ACATGCACACACGT | 4.92e-12 | 3.5e-14 | 102 | 0.73 | 194 | 0.27 | 2.77 |
99 | 32 | MA1596.1 | ZNF460 | MA1596.1 | GCCTCMGCCTCCCRAG | ZNF460 | 7.58e-12 | 5.34e-14 | 3450 | 24.9 | 15900 | 21.9 | 1.14 |
100 | 6 | MA1134.1 | FOS::JUNB | MA1134.1 | KATGASTCATHN | FOS::JUNB | 8.03e-12 | 5.6e-14 | 483 | 3.48 | 1690 | 2.32 | 1.5 |
101 | 29 | MA0813.1 | TFAP2B | MA0813.1 | TGCCCYBRGGGCA | TFAP2B | 8.33e-12 | 5.75e-14 | 3280 | 23.6 | 15100 | 20.7 | 1.14 |
102 | 33 | MA0527.1 | ZBTB33 | MA0527.1 | BTCTCGCGAGABYTS | ZBTB33 | 1e-11 | 6.84e-14 | 832 | 5.99 | 3240 | 4.45 | 1.35 |
103 | 34 | MA1108.2 (MXI1) | STREME-11 | 11-ACACACATGTGCA | 11-ACACACATGTGCA | MXI1 | 1.48e-11 | 1e-13 | 80 | 0.58 | 132 | 0.18 | 3.2 |
104 | 35 | MA1643.1 | NFIB | MA1643.1 | NNCCTGGCAYNGTGCCAAGNN | NFIB | 2.69e-11 | 1.81e-13 | 4960 | 35.7 | 23600 | 32.4 | 1.1 |
105 | 6 | MA1988.1 | Atf3 | MA1988.1 | NRTGACTCABN | Atf3 | 4.23e-11 | 2.81e-13 | 495 | 3.56 | 1760 | 2.42 | 1.48 |
106 | 10 | MA1627.1 | Wt1 | MA1627.1 | YBCCTCCCCCACVB | Wt1 | 5.31e-11 | 3.49e-13 | 12100 | 87.2 | 61800 | 84.8 | 1.03 |
107 | 6 | MA0659.3 | Mafg | MA0659.3 | NWGMTGACTCAGCAN | Mafg | 1.66e-10 | 1.08e-12 | 4810 | 34.6 | 22900 | 31.5 | 1.1 |
108 | 29 | MA0815.1 | TFAP2C | MA0815.1 | TGCCCYSRGGGCA | TFAP2C | 9.26e-10 | 5.98e-12 | 1510 | 10.9 | 6520 | 8.95 | 1.22 |
109 | 15 | MA0812.1 | TFAP2B | MA0812.1 | HGCCTSAGGCD | TFAP2B | 1.25e-09 | 8.02e-12 | 8700 | 62.7 | 43400 | 59.5 | 1.05 |
110 | 36 | MA1514.1 | KLF17 | MA1514.1 | MMCCACGCACCCMTY | KLF17 | 1.61e-09 | 1.02e-11 | 4900 | 35.3 | 23500 | 32.2 | 1.09 |
111 | 30 | MA0616.2 | HES2 | MA0616.2 | GGCACGTGYC | HES2 | 2.27e-09 | 1.42e-11 | 1170 | 8.43 | 4920 | 6.75 | 1.25 |
112 | 15 | MA0003.4 | TFAP2A | MA0003.4 | NNKGCCTCAGGCNN | TFAP2A | 2.68e-09 | 1.67e-11 | 11800 | 85.2 | 60400 | 82.9 | 1.03 |
113 | 6 | MA1114.1 | PBX3 | MA1114.1 | NNNTGAGTGACAGGBNN | PBX3 | 2.71e-09 | 1.67e-11 | 9400 | 67.7 | 47100 | 64.7 | 1.05 |
114 | 30 | MA0608.1 | Creb3l2 | MA0608.1 | GCCACGTGD | Creb3l2 | 2.99e-09 | 1.83e-11 | 4460 | 32.2 | 21300 | 29.2 | 1.1 |
115 | 30 | MA1631.1 | ASCL1 | MA1631.1 | NNGCACCTGCYNB | ASCL1 | 3.33e-09 | 2.02e-11 | 11400 | 82.3 | 58200 | 79.8 | 1.03 |
116 | 31 | MA1155.1 (ZSCAN4) | STREME-7 | 7-GTGTGTGCATGYGTG | GTGTGTGCATGYGTG | ZSCAN4 | 3.5e-09 | 2.1e-11 | 96 | 0.69 | 203 | 0.28 | 2.49 |
117 | 37 | MA0734.3 | Gli2 | MA0734.3 | NRGACCACCCASV | Gli2 | 8.06e-09 | 4.8e-11 | 9050 | 65.2 | 45300 | 62.2 | 1.05 |
118 | 10 | MA1656.1 | ZNF449 | MA1656.1 | NNAAGCCCAACCNN | ZNF449 | 1.7e-08 | 1e-10 | 7680 | 55.3 | 38100 | 52.3 | 1.06 |
119 | 27 | MA1719.1 | ZNF816 | MA1719.1 | DRRKGGGGACATGCHGGG | ZNF816 | 3.43e-08 | 2.01e-10 | 8540 | 61.5 | 42700 | 58.6 | 1.05 |
120 | 15 | MA1569.1 | TFAP2E | MA1569.1 | YGCCYSRGGCD | TFAP2E | 3.83e-08 | 2.22e-10 | 1020 | 7.37 | 4300 | 5.9 | 1.25 |
121 | 30 | MA0830.2 | TCF4 | MA0830.2 | NNGCACCTGCCNN | TCF4 | 4.89e-08 | 2.82e-10 | 10700 | 76.8 | 54100 | 74.2 | 1.03 |
122 | 38 | MA1557.1 | SMAD5 | MA1557.1 | YGTCTAGACA | SMAD5 | 5.21e-08 | 2.98e-10 | 4150 | 29.9 | 19800 | 27.2 | 1.1 |
123 | 22 | MA0774.1 | MEIS2 | MA0774.1 | TTGACAGS | MEIS2 | 7.56e-08 | 4.29e-10 | 2670 | 19.2 | 12400 | 17 | 1.13 |
124 | 39 | MA0864.2 | E2F2 | MA0864.2 | RWTTTGGCGCCAWWWY | E2F2 | 8.03e-08 | 4.51e-10 | 4360 | 31.4 | 20900 | 28.7 | 1.09 |
125 | 40 | MA1966.1 | TFAP4::ETV1 | MA1966.1 | RSCGGAAGCAGSTGKN | TFAP4::ETV1 | 9.07e-08 | 5.06e-10 | 6280 | 45.2 | 30800 | 42.3 | 1.07 |
126 | 40 | MA0076.2 | ELK4 | MA0076.2 | BCRCTTCCGGB | ELK4 | 9.93e-08 | 5.49e-10 | 367 | 2.64 | 1310 | 1.8 | 1.47 |
127 | 7 | MA0528.2 | ZNF263 | MA0528.2 | VNGGGGAGGASG | ZNF263 | 5.17e-07 | 2.84e-09 | 7320 | 52.8 | 36400 | 49.9 | 1.06 |
128 | 30 | MA0649.1 | HEY2 | MA0649.1 | GRCACGTGYC | HEY2 | 5.95e-07 | 3.24e-09 | 3550 | 25.6 | 16900 | 23.2 | 1.1 |
129 | 30 | MA0147.3 | MYC | MA0147.3 | NNCCACGTGCNB | MYC | 6.28e-07 | 3.4e-09 | 11100 | 80.1 | 56700 | 77.8 | 1.03 |
130 | 24 | MA0067.2 | PAX2 | MA0067.2 | BCAVTSRAGCGTGACGR | PAX2 | 7.94e-07 | 4.26e-09 | 4330 | 31.2 | 20900 | 28.7 | 1.09 |
131 | 41 | MA0767.1 | GCM2 | MA0767.1 | BATGCGGGTR | GCM2 | 8.71e-07 | 4.6e-09 | 5700 | 41 | 27900 | 38.3 | 1.07 |
132 | 39 | MA1727.1 | ZNF417 | MA1727.1 | VRBVNTGGGCGCCAM | ZNF417 | 8.71e-07 | 4.6e-09 | 3370 | 24.3 | 16000 | 21.9 | 1.11 |
133 | 30 | MA0821.2 | HES5 | MA0821.2 | GRCACGTGYC | HES5 | 9.4e-07 | 4.93e-09 | 722 | 5.2 | 2950 | 4.05 | 1.29 |
134 | 40 | MA0764.3 | ETV4 | MA0764.3 | ACCGGAAGTR | ETV4 | 1.06e-06 | 5.53e-09 | 246 | 1.77 | 822 | 1.13 | 1.57 |
135 | 30 | MA0823.1 | HEY1 | MA0823.1 | GRCACGTGCC | HEY1 | 1.37e-06 | 7.08e-09 | 1290 | 9.28 | 5650 | 7.76 | 1.2 |
136 | 30 | MA0603.1 | Arntl | MA0603.1 | NGTCACGTGH | Arntl | 1.53e-06 | 7.85e-09 | 10500 | 75.8 | 53500 | 73.4 | 1.03 |
137 | 42 | MA1964.1 | SMAD2 | MA1964.1 | NBCCAGACDN | SMAD2 | 1.57e-06 | 8e-09 | 9780 | 70.4 | 49500 | 67.9 | 1.04 |
138 | 30 | MA1106.1 | HIF1A | MA1106.1 | SBACGTGCNN | HIF1A | 4.05e-06 | 2.05e-08 | 3140 | 22.6 | 14900 | 20.4 | 1.11 |
139 | 30 | MA1464.1 | ARNT2 | MA1464.1 | GTCACGTGMN | ARNT2 | 5.25e-06 | 2.63e-08 | 3400 | 24.5 | 16200 | 22.3 | 1.1 |
140 | 43 | MA1684.1 | Foxn1 | MA1684.1 | RGAMGC | Foxn1 | 5.72e-06 | 2.85e-08 | 2230 | 16 | 10300 | 14.2 | 1.13 |
141 | 30 | MA1474.1 | CREB3L4 | MA1474.1 | YGCCACGTCAYC | CREB3L4 | 5.99e-06 | 2.96e-08 | 9250 | 66.6 | 46700 | 64.1 | 1.04 |
142 | 7 | MA1578.1 | VEZF1 | MA1578.1 | CCCCCCMYDH | VEZF1 | 8.96e-06 | 4.4e-08 | 1960 | 14.2 | 9030 | 12.4 | 1.14 |
143 | 44 | MA0028.2 | ELK1 | MA0028.2 | ACCGGAAGTR | ELK1 | 1.13e-05 | 5.51e-08 | 182 | 1.31 | 582 | 0.8 | 1.65 |
144 | 39 | MA0024.3 | E2F1 | MA0024.3 | TTTGGCGCCAAA | E2F1 | 1.15e-05 | 5.57e-08 | 3240 | 23.3 | 15400 | 21.2 | 1.1 |
145 | 6 | MA1521.1 | MAFA | MA1521.1 | NTGCTGASTCAGCAN | MAFA | 1.17e-05 | 5.6e-08 | 4410 | 31.7 | 21400 | 29.4 | 1.08 |
146 | 24 | MA1995.1 | Npas4 | MA1995.1 | NNTCGTGACHN | Npas4 | 2.58e-05 | 1.23e-07 | 4500 | 32.4 | 21900 | 30 | 1.08 |
147 | 44 | MA0765.3 | ETV5 | MA0765.3 | ACCGGAAGTR | ETV5 | 2.96e-05 | 1.4e-07 | 184 | 1.33 | 600 | 0.82 | 1.62 |
148 | 45 | MA1710.1 | ZNF257 | MA1710.1 | VDGAGGCRAGRG | ZNF257 | 4.4e-05 | 2.07e-07 | 4500 | 32.4 | 21900 | 30.1 | 1.08 |
149 | 46 | MA0597.2 | THAP1 | MA0597.2 | VSGCAGGGCASV | THAP1 | 5.45e-05 | 2.55e-07 | 10100 | 72.9 | 51500 | 70.7 | 1.03 |
150 | 39 | MA0470.2 | E2F4 | MA0470.2 | TTTTGGCGCCAWWW | E2F4 | 9.12e-05 | 4.24e-07 | 1950 | 14 | 9050 | 12.4 | 1.13 |
151 | 1 | MA1516.1 | KLF3 | MA1516.1 | GRCCRCGCCCH | KLF3 | 9.28e-05 | 4.29e-07 | 144 | 1.04 | 448 | 0.61 | 1.69 |
152 | 30 | MA0822.1 | HES7 | MA0822.1 | YGGCACGTGCCR | HES7 | 0.000115 | 5.28e-07 | 793 | 5.71 | 3400 | 4.66 | 1.23 |
153 | 40 | MA0633.2 | Twist2 | MA0633.2 | NVCAGCTGBN | Twist2 | 0.000141 | 6.41e-07 | 4020 | 28.9 | 19500 | 26.8 | 1.08 |
154 | 44 | MA0759.2 | ELK3 | MA0759.2 | ACCGGAAGTRV | ELK3 | 0.000166 | 7.51e-07 | 186 | 1.34 | 627 | 0.86 | 1.56 |
155 | 40 | MA1997.1 | Olig2 | MA1997.1 | NVCAGCTGBN | Olig2 | 0.00023 | 1.03e-06 | 7730 | 55.7 | 38900 | 53.4 | 1.04 |
156 | 30 | MA0844.1 | XBP1 | MA0844.1 | WDKGMCACGTCATC | XBP1 | 0.000261 | 1.17e-06 | 4620 | 33.2 | 22700 | 31.1 | 1.07 |
157 | 30 | MA1934.1 | ERF::FIGLA | MA1934.1 | RCCGGAARCASCTGBN | ERF::FIGLA | 0.000309 | 1.37e-06 | 2770 | 20 | 13200 | 18.2 | 1.1 |
158 | 37 | MA1990.1 | Gli1 | MA1990.1 | NNRGACCACCCASV | Gli1 | 0.000352 | 1.55e-06 | 5140 | 37 | 25400 | 34.8 | 1.06 |
159 | 47 | MA1982.1 | ZNF574 | MA1982.1 | VGSCTAGAGMGGCCS | ZNF574 | 0.000374 | 1.64e-06 | 7150 | 51.5 | 35800 | 49.2 | 1.05 |
160 | 40 | MA1941.1 | ETV2::FIGLA | MA1941.1 | RSCGGAARCAGSTGBN | ETV2::FIGLA | 0.000419 | 1.82e-06 | 5860 | 42.2 | 29100 | 40 | 1.06 |
161 | 48 | MA1730.1 | ZNF708 | MA1730.1 | NNRGCTGTRCCTCCY | ZNF708 | 0.000426 | 1.85e-06 | 7290 | 52.5 | 36600 | 50.2 | 1.05 |
162 | 22 | MA0775.1 | MEIS3 | MA0775.1 | DTGACAGS | MEIS3 | 0.000489 | 2.1e-06 | 6010 | 43.3 | 29900 | 41.1 | 1.05 |
163 | 30 | MA0692.1 | TFEB | MA0692.1 | RYCACGTGAC | TFEB | 0.000506 | 2.16e-06 | 11700 | 84.2 | 60100 | 82.5 | 1.02 |
164 | 6 | MA0782.2 | PKNOX1 | MA0782.2 | NNTGAGTGACAGVNN | PKNOX1 | 0.000525 | 2.23e-06 | 9390 | 67.6 | 47700 | 65.5 | 1.03 |
165 | 39 | MA0615.1 | Gmeb1 | MA0615.1 | BHBBKKACGTMMNWNNN | Gmeb1 | 0.000681 | 2.88e-06 | 635 | 4.57 | 2690 | 3.7 | 1.24 |
166 | 22 | MA0498.2 | MEIS1 | MA0498.2 | HTGACAD | MEIS1 | 0.000717 | 3.01e-06 | 7520 | 54.1 | 37800 | 51.9 | 1.04 |
167 | 30 | MA1493.1 | HES6 | MA1493.1 | GGCACGTGTY | HES6 | 0.000745 | 3.11e-06 | 3580 | 25.8 | 17400 | 23.9 | 1.08 |
168 | 30 | MA1560.1 | SOHLH2 | MA1560.1 | YGCACGTGCN | SOHLH2 | 0.000797 | 3.31e-06 | 269 | 1.94 | 1010 | 1.38 | 1.41 |
169 | 30 | MA0004.1 | Arnt | MA0004.1 | CACGTG | Arnt | 0.000897 | 3.7e-06 | 2180 | 15.7 | 10300 | 14.2 | 1.11 |
170 | 27 | MA1102.2 | CTCFL | MA1102.2 | NSCAGGGGGCGS | CTCFL | 0.00121 | 4.98e-06 | 6740 | 48.5 | 33800 | 46.4 | 1.05 |
171 | 30 | MA0499.2 | MYOD1 | MA0499.2 | NNGCACCTGTCNB | MYOD1 | 0.00126 | 5.15e-06 | 11300 | 81.6 | 58200 | 79.9 | 1.02 |
172 | 37 | MA0695.1 | ZBTB7C | MA0695.1 | RCGACCACCGAN | ZBTB7C | 0.00181 | 7.32e-06 | 1550 | 11.2 | 7180 | 9.85 | 1.13 |
173 | 44 | MA1933.1 | ELK1::SREBF2 | MA1933.1 | DCCGGAAGTSRCGTGA | ELK1::SREBF2 | 0.00246 | 9.92e-06 | 2010 | 14.5 | 9510 | 13 | 1.11 |
174 | 40 | MA0763.1 | ETV3 | MA0763.1 | ACCGGAAGTR | ETV3 | 0.00277 | 1.11e-05 | 239 | 1.72 | 893 | 1.23 | 1.41 |
175 | 44 | MA0156.3 | FEV | MA0156.3 | VACCGGAAGTVV | FEV | 0.00285 | 1.13e-05 | 238 | 1.71 | 889 | 1.22 | 1.41 |
176 | 49 | MA0155.1 | INSM1 | MA0155.1 | TGYCWGGGGGCR | INSM1 | 0.00321 | 1.27e-05 | 2430 | 17.5 | 11600 | 15.9 | 1.1 |
177 | 37 | MA0694.1 | ZBTB7B | MA0694.1 | RCGACCACCGAA | ZBTB7B | 0.00518 | 2.04e-05 | 3320 | 23.9 | 16200 | 22.2 | 1.08 |
178 | 21 | MA1723.1 | PRDM9 | MA1723.1 | RGDGGGVAGGGRGGVRRMARVARR | PRDM9 | 0.00551 | 2.16e-05 | 3830 | 27.6 | 18800 | 25.8 | 1.07 |
179 | 50 | MA1981.1 | ZNF530 | MA1981.1 | GMARGGMRAGGGGCRGSV | ZNF530 | 0.00557 | 2.17e-05 | 393 | 2.83 | 1610 | 2.2 | 1.29 |
180 | 21 | MA0163.1 | PLAG1 | MA0163.1 | GGGGCCCWAGGGGG | PLAG1 | 0.00771 | 2.99e-05 | 1360 | 9.82 | 6320 | 8.68 | 1.13 |
181 | 51 | MA1584.1 | ZIC5 | MA1584.1 | VGACCCCCCGCTGHGM | ZIC5 | 0.00788 | 3.03e-05 | 1420 | 10.2 | 6600 | 9.05 | 1.13 |
182 | 40 | MA0645.1 | ETV6 | MA0645.1 | MSCGGAAGTR | ETV6 | 0.0118 | 4.54e-05 | 151 | 1.09 | 529 | 0.73 | 1.5 |
183 | 15 | MA1581.1 | ZBTB6 | MA1581.1 | DNSCTTGAGCCNV | ZBTB6 | 0.0122 | 4.64e-05 | 4470 | 32.2 | 22100 | 30.4 | 1.06 |
184 | 40 | MA1484.1 | ETS2 | MA1484.1 | DACCGGAAGY | ETS2 | 0.0137 | 5.18e-05 | 215 | 1.55 | 811 | 1.11 | 1.4 |
185 | 30 | MA0058.3 | MAX | MA0058.3 | AVCACGTGNY | MAX | 0.0193 | 7.29e-05 | 8610 | 62 | 43800 | 60.1 | 1.03 |
186 | 52 | MA0863.1 | MTF1 | MA0863.1 | TTTGCACACGGCAC | MTF1 | 0.024 | 9.01e-05 | 1540 | 11 | 7220 | 9.91 | 1.12 |
187 | 15 | MA1629.1 | Zic2 | MA1629.1 | NDCACAGCAGGDRG | Zic2 | 0.0279 | 0.000104 | 12300 | 88.5 | 63600 | 87.3 | 1.01 |
188 | 30 | MA0103.3 | ZEB1 | MA0103.3 | SNCACCTGSVN | ZEB1 | 0.0279 | 0.000104 | 12500 | 90.3 | 65000 | 89.2 | 1.01 |
16 | 6 | MA1633.2 (BACH1) | MEME-2 | TGRCTCAYGCCTGTA | TGRCTCAYGCCTGTA | BACH1 | 15.4 | 0.961 | 168 | 1.21 | 959 | 1.32 | 0.924 |
17 | 53 | MA1125.1 (ZNF384) | MEME-1 | WAAAAAAAAAAAAAA | WAAAAAAAAAAAAAA | ZNF384 | 16.9 | 0.995 | 22 | 0.16 | 159 | 0.22 | 0.754 |
18 | 54 | MA0114.4 (HNF4A) | MEME-4 | CCAAAGTGCTGGGAT | CCAAAGTGCTGGGAT | HNF4A | 18 | 1 | 542 | 3.9 | 3360 | 4.62 | 0.846 |
MTX_24_20kb_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("MTX_24_20kb_combo response peaks(200bp)Enrichment ratio:",ER_rat))
MTX_24_20kb_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
# slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*3), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./3,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("MTX_24_20kb_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DOX_3_DAR_DEG_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DOX_3_DEG_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
DOX_3_DAR_DEG_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DOX_3_DEG_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
DOX_3_DAR_DEG_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DOX_3_DEG_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
DOX_3_DAR_DEG_sea_all <- rbind(DOX_3_DAR_DEG_sea_disc, DOX_3_DAR_DEG_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
DOX_3_DAR_DEG_combo <- DOX_3_DAR_DEG_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., DOX_3_DAR_DEG_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
DOX_3_DAR_DEG_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in MTX v VEH") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA0528.2 (ZNF263) | STREME-1 | 1-GAGCTGGGAGGAV | GAGCTGGGAGGAV | ZNF263 | 1.63e-08 | 1.23e-08 | 6 | 100 | 40 | 1.15 | 84.6 |
2 | 2 | MA0697.2 (Zic3) | STREME-2 | 2-CAGCAGGBGCGB | CAGCAGGBGCGB | Zic3 | 2.63e-08 | 1.23e-08 | 5 | 83.3 | 11 | 0.32 | 248 |
3 | 3 | MA0154.4 (EBF1) | STREME-3 | 3-CCCYCYAGGVGRRG | CCCYCYAGGVGRRG | EBF1 | 1.94e-05 | 6.02e-06 | 3 | 50 | 1 | 0.03 | 991 |
4 | 4 | MA1629.1 (Zic2) | STREME-4 | 4-CCGGCYGGGAHT | CCGGCYGGGAHT | Zic2 | 7.27e-05 | 1.69e-05 | 3 | 50 | 2 | 0.06 | 661 |
5 | 5 | MA1615.1 (Plagl1) | STREME-5 | 5-CTGGCCGCCG | CTGGCCGCCG | Plagl1 | 0.000124 | 2.31e-05 | 4 | 66.7 | 22 | 0.63 | 108 |
DOX_3_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DOX_3_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DOX_3_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*.05), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./.05,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DOX_3_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DOX_24_DAR_DEG_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DOX_24_DEG_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
DOX_24_DAR_DEG_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DOX_24_DEG_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
DOX_24_DAR_DEG_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DOX_24_DEG_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
DOX_24_DAR_DEG_sea_all <- rbind(DOX_24_DAR_DEG_sea_disc, DOX_24_DAR_DEG_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
DOX_24_DAR_DEG_combo <- DOX_24_DAR_DEG_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., DOX_24_DAR_DEG_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
DOX_24_DAR_DEG_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in MTX v VEH") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA0742.2 (KLF12) | STREME-2 | 2-CCCGYCCS | CCCGYCCS | KLF12 | 3.39e-32 | 2.56e-32 | 1890 | 11.4 | 3360 | 8.06 | 1.41 |
2 | 2 | MA1596.1 | ZNF460 | MA1596.1 | GCCTCMGCCTCCCRAG | ZNF460 | 5.12e-30 | 1.93e-30 | 5110 | 30.8 | 10800 | 25.9 | 1.19 |
3 | 3 | MA1973.1 (ZKSCAN3) | STREME-8 | 8-SCCCAGGCTGSSS | SCCCAGGCTGSSS | ZKSCAN3 | 2.86e-29 | 7.21e-30 | 6100 | 36.8 | 13200 | 31.7 | 1.16 |
4 | 4 | MA1713.1 | ZNF610 | MA1713.1 | SSCGCCGCTCCSSS | ZNF610 | 3.44e-26 | 6.49e-27 | 2450 | 14.8 | 4740 | 11.4 | 1.3 |
5 | 5 | MA1713.1 (ZNF610) | STREME-3 | 3-CCGCTCC | CCGCTCC | ZNF610 | 4.41e-26 | 6.66e-27 | 6030 | 36.3 | 13100 | 31.5 | 1.15 |
6 | 6 | MA1513.1 | KLF15 | MA1513.1 | SCCCCGCCCCS | KLF15 | 1.22e-24 | 1.54e-25 | 2510 | 15.1 | 4900 | 11.8 | 1.29 |
7 | 7 | MA0732.1 (EGR3) | STREME-6 | 6-GGCGSW | GGCGSW | EGR3 | 1.11e-23 | 1.19e-24 | 3540 | 21.3 | 7300 | 17.5 | 1.22 |
8 | 6 | MA1961.1 | PATZ1 | MA1961.1 | SSGGGGMGGGGS | PATZ1 | 1.11e-22 | 1.05e-23 | 2780 | 16.8 | 5580 | 13.4 | 1.25 |
9 | 8 | MA1712.1 (ZNF454) | STREME-4 | 4-SGGSCCS | SGGSCCS | ZNF454 | 3.48e-22 | 2.92e-23 | 2880 | 17.4 | 5820 | 14 | 1.24 |
10 | 4 | MA1522.1 | MAZ | MA1522.1 | BBCCCCTCCCY | MAZ | 6.74e-22 | 5.09e-23 | 5070 | 30.5 | 11000 | 26.3 | 1.16 |
11 | 9 | MA1650.1 | ZBTB14 | MA1650.1 | SSCCGCGCACVS | ZBTB14 | 4.96e-21 | 3.4e-22 | 3700 | 22.3 | 7760 | 18.6 | 1.2 |
12 | 6 | MA1564.1 | SP9 | MA1564.1 | RCCACGCCCMCY | SP9 | 1.23e-20 | 7.77e-22 | 5610 | 33.8 | 12300 | 29.6 | 1.14 |
13 | 6 | MA1630.2 | ZNF281 | MA1630.2 | GGGGGAGGGGVV | ZNF281 | 6.53e-20 | 3.79e-21 | 6330 | 38.1 | 14100 | 33.8 | 1.13 |
14 | 6 | MA0039.4 | KLF4 | MA0039.4 | NNCCCCACCCHN | KLF4 | 1.15e-19 | 6.22e-21 | 5050 | 30.4 | 11000 | 26.4 | 1.15 |
15 | 10 | 1-CCGGGCC | STREME-1 | 1-CCGGGCC | CCGGGCC | CCGGGCC | 2.24e-19 | 1.13e-20 | 2480 | 14.9 | 4970 | 11.9 | 1.25 |
16 | 11 | MA0810.1 | TFAP2A | MA0810.1 | YGCCCBVRGGCR | TFAP2A | 2.46e-19 | 1.16e-20 | 6530 | 39.4 | 14600 | 35.1 | 1.12 |
17 | 6 | MA1511.2 | KLF10 | MA1511.2 | GGGGCGGGG | KLF10 | 2.94e-19 | 1.3e-20 | 3100 | 18.6 | 6410 | 15.4 | 1.21 |
18 | 6 | MA1515.1 | KLF2 | MA1515.1 | NRCCACRCCCH | KLF2 | 3.14e-19 | 1.3e-20 | 5420 | 32.6 | 11900 | 28.6 | 1.14 |
19 | 12 | MA0162.4 | EGR1 | MA0162.4 | VCMCGCCCACGCVS | EGR1 | 3.27e-19 | 1.3e-20 | 4120 | 24.8 | 8820 | 21.1 | 1.17 |
20 | 6 | MA0742.2 | KLF12 | MA0742.2 | GGGGCGGGG | KLF12 | 1.13e-18 | 4.28e-20 | 1810 | 10.9 | 3490 | 8.35 | 1.3 |
21 | 6 | MA1517.1 | KLF6 | MA1517.1 | NRCCACGCCCH | KLF6 | 1.29e-18 | 4.65e-20 | 4430 | 26.7 | 9570 | 22.9 | 1.16 |
22 | 13 | MA1102.2 | CTCFL | MA1102.2 | NSCAGGGGGCGS | CTCFL | 1.38e-18 | 4.73e-20 | 2430 | 14.6 | 4900 | 11.7 | 1.25 |
23 | 14 | MA0814.2 | TFAP2C | MA0814.2 | NNGCCTGAGGCGNN | TFAP2C | 1.84e-18 | 5.73e-20 | 4270 | 25.7 | 9200 | 22 | 1.17 |
24 | 6 | MA0746.2 | SP3 | MA0746.2 | NGCCACGCCCMCH | SP3 | 1.87e-18 | 5.73e-20 | 5010 | 30.2 | 11000 | 26.3 | 1.15 |
25 | 6 | MA1516.1 | KLF3 | MA1516.1 | GRCCRCGCCCH | KLF3 | 1.9e-18 | 5.73e-20 | 4130 | 24.9 | 8860 | 21.2 | 1.17 |
26 | 2 | MA1965.1 | SP5 | MA1965.1 | CBCCTCCCHS | SP5 | 2.95e-18 | 8.28e-20 | 5610 | 33.8 | 12400 | 29.8 | 1.13 |
27 | 6 | MA0753.2 | ZNF740 | MA0753.2 | CYRCCCCCCCCAC | ZNF740 | 2.96e-18 | 8.28e-20 | 7490 | 45.1 | 17000 | 40.8 | 1.1 |
28 | 15 | MA1122.1 | TFDP1 | MA1122.1 | VSGCGGGAAVN | TFDP1 | 3.19e-18 | 8.62e-20 | 2590 | 15.6 | 5280 | 12.6 | 1.24 |
29 | 16 | MA0506.2 | Nrf1 | MA0506.2 | SNCTGCGCMTGCGCV | Nrf1 | 4.23e-17 | 1.07e-18 | 3850 | 23.2 | 8260 | 19.8 | 1.17 |
30 | 17 | MA1976.1 | ZNF320 | MA1976.1 | DGKGGGRCCMRGGGGCCWGTGHVNNNN | ZNF320 | 4.25e-17 | 1.07e-18 | 4660 | 28.1 | 10200 | 24.4 | 1.15 |
31 | 18 | MA1986.1 | ZNF692 | MA1986.1 | BGGGCCCASN | ZNF692 | 4.55e-17 | 1.11e-18 | 5850 | 35.2 | 13100 | 31.3 | 1.13 |
32 | 6 | MA1959.1 | KLF7 | MA1959.1 | DGGGCGGGG | KLF7 | 5.14e-17 | 1.21e-18 | 7360 | 44.3 | 16800 | 40.2 | 1.1 |
33 | 19 | MA1596.1 (ZNF460) | STREME-5 | 5-GCSTCC | GCSTCC | ZNF460 | 7.3e-17 | 1.67e-18 | 5240 | 31.6 | 11600 | 27.8 | 1.14 |
34 | 20 | MA1712.1 | ZNF454 | MA1712.1 | DGGCTCSSGGCCCYGVKG | ZNF454 | 3.69e-16 | 8.2e-18 | 3880 | 23.4 | 8360 | 20 | 1.17 |
35 | 12 | MA0732.1 | EGR3 | MA0732.1 | HHMCGCCCACGCAHH | EGR3 | 1.1e-15 | 2.37e-17 | 3010 | 18.1 | 6330 | 15.2 | 1.2 |
36 | 14 | MA0003.4 | TFAP2A | MA0003.4 | NNKGCCTCAGGCNN | TFAP2A | 1.47e-15 | 3.08e-17 | 5430 | 32.7 | 12100 | 29 | 1.13 |
37 | 21 | MA0024.3 | E2F1 | MA0024.3 | TTTGGCGCCAAA | E2F1 | 2.8e-15 | 5.72e-17 | 3270 | 19.7 | 6940 | 16.6 | 1.18 |
38 | 22 | MA0131.2 | HINFP | MA0131.2 | CARCGTCCGCKN | HINFP | 3.79e-15 | 7.53e-17 | 1880 | 11.4 | 3750 | 8.99 | 1.26 |
39 | 11 | MA0524.2 | TFAP2C | MA0524.2 | YGCCYBVRGGCA | TFAP2C | 5.42e-15 | 1.05e-16 | 7300 | 44 | 16700 | 40.1 | 1.1 |
40 | 6 | MA0079.5 | SP1 | MA0079.5 | GGGGCGGGG | SP1 | 6.16e-15 | 1.16e-16 | 1120 | 6.77 | 2070 | 4.95 | 1.37 |
41 | 15 | MA0865.2 | E2F8 | MA0865.2 | TTCCCGCCAHWA | E2F8 | 6.49e-15 | 1.18e-16 | 3880 | 23.4 | 8410 | 20.2 | 1.16 |
42 | 6 | MA0493.2 | KLF1 | MA0493.2 | DGGGYGGGG | KLF1 | 6.58e-15 | 1.18e-16 | 2660 | 16 | 5540 | 13.3 | 1.21 |
43 | 23 | MA0736.1 | GLIS2 | MA0736.1 | GACCCCCCGCRAMG | GLIS2 | 6.73e-15 | 1.18e-16 | 5260 | 31.7 | 11700 | 28.1 | 1.13 |
44 | 11 | MA0811.1 | TFAP2B | MA0811.1 | YGCCCBVRGGCA | TFAP2B | 1.04e-14 | 1.78e-16 | 5480 | 33 | 12300 | 29.4 | 1.12 |
45 | 6 | MA0741.1 | KLF16 | MA0741.1 | GMCACGCCCCC | KLF16 | 1.59e-14 | 2.67e-16 | 2560 | 15.4 | 5310 | 12.7 | 1.21 |
46 | 15 | MA1583.1 | ZFP57 | MA1583.1 | NNVTTGCCGCANN | ZFP57 | 1.83e-14 | 2.99e-16 | 1750 | 10.6 | 3470 | 8.32 | 1.27 |
47 | 6 | MA1512.1 | KLF11 | MA1512.1 | SCCACGCCCMC | KLF11 | 1.86e-14 | 2.99e-16 | 7410 | 44.6 | 17000 | 40.8 | 1.09 |
48 | 6 | MA0516.3 | SP2 | MA0516.3 | GGGGCGGGG | SP2 | 3.36e-14 | 5.28e-16 | 1150 | 6.93 | 2140 | 5.13 | 1.35 |
49 | 6 | MA0747.1 | SP8 | MA0747.1 | RCCACGCCCMCY | SP8 | 5.18e-14 | 7.99e-16 | 4910 | 29.6 | 10900 | 26.2 | 1.13 |
50 | 11 | MA0812.1 | TFAP2B | MA0812.1 | HGCCTSAGGCD | TFAP2B | 9.27e-14 | 1.4e-15 | 7960 | 48 | 18500 | 44.2 | 1.09 |
51 | 6 | MA0685.2 | SP4 | MA0685.2 | GGGGCGGGG | SP4 | 3.96e-13 | 5.87e-15 | 1140 | 6.88 | 2150 | 5.14 | 1.34 |
52 | 23 | MA0751.1 | ZIC4 | MA0751.1 | GRCCCCCCGCKGYGH | ZIC4 | 5.64e-13 | 8.2e-15 | 5790 | 34.9 | 13100 | 31.4 | 1.11 |
53 | 24 | MA0146.2 | Zfx | MA0146.2 | SSSGCCBVGGCCTS | Zfx | 1.4e-12 | 1.99e-14 | 2380 | 14.4 | 4970 | 11.9 | 1.2 |
54 | 6 | MA0740.2 | KLF14 | MA0740.2 | KGGGCGGGG | KLF14 | 1.49e-12 | 2.09e-14 | 2280 | 13.7 | 4730 | 11.3 | 1.21 |
55 | 25 | MA1985.1 | ZNF669 | MA1985.1 | GGGGYGAYGACCRCT | ZNF669 | 2.41e-12 | 3.3e-14 | 4390 | 26.4 | 9750 | 23.4 | 1.13 |
56 | 6 | MA0471.2 | E2F6 | MA0471.2 | VVDGGCGGGAAVV | E2F6 | 2.56e-12 | 3.46e-14 | 1780 | 10.7 | 3600 | 8.63 | 1.24 |
57 | 17 | MA1723.1 | PRDM9 | MA1723.1 | RGDGGGVAGGGRGGVRRMARVARR | PRDM9 | 2.9e-12 | 3.85e-14 | 7060 | 42.5 | 16300 | 39 | 1.09 |
58 | 6 | MA0599.1 | KLF5 | MA0599.1 | GCCCCDCCCH | KLF5 | 4.81e-12 | 6.27e-14 | 2180 | 13.2 | 4530 | 10.9 | 1.21 |
59 | 23 | MA1578.1 | VEZF1 | MA1578.1 | CCCCCCMYDH | VEZF1 | 5.09e-12 | 6.52e-14 | 8190 | 49.3 | 19100 | 45.8 | 1.08 |
60 | 26 | MA1721.1 | ZNF93 | MA1721.1 | GGYAGCRGCAGCGGYG | ZNF93 | 7.75e-12 | 9.76e-14 | 2830 | 17.1 | 6060 | 14.5 | 1.18 |
61 | 12 | MA1107.2 | KLF9 | MA1107.2 | MNGCCACACCCACHYV | KLF9 | 8.27e-12 | 1.02e-13 | 7160 | 43.1 | 16600 | 39.7 | 1.09 |
62 | 27 | MA1728.1 | ZNF549 | MA1728.1 | NNTGCTGCCCWR | ZNF549 | 9.99e-12 | 1.22e-13 | 6990 | 42.1 | 16100 | 38.6 | 1.09 |
63 | 28 | MA0694.1 | ZBTB7B | MA0694.1 | RCGACCACCGAA | ZBTB7B | 1.49e-11 | 1.79e-13 | 6780 | 40.8 | 15600 | 37.5 | 1.09 |
64 | 12 | MA0472.2 | EGR2 | MA0472.2 | MCGCCCACGCA | EGR2 | 1.8e-11 | 2.12e-13 | 3350 | 20.2 | 7300 | 17.5 | 1.15 |
65 | 29 | MA1514.1 | KLF17 | MA1514.1 | MMCCACGCACCCMTY | KLF17 | 2.38e-11 | 2.76e-13 | 8010 | 48.3 | 18700 | 44.8 | 1.08 |
66 | 30 | MA0014.3 | PAX5 | MA0014.3 | RRGCGTGACCNN | PAX5 | 2.71e-11 | 3.1e-13 | 4620 | 27.8 | 10300 | 24.8 | 1.12 |
67 | 31 | MA0748.2 | YY2 | MA0748.2 | NVATGGCGGCS | YY2 | 3.32e-11 | 3.74e-13 | 3730 | 22.5 | 8220 | 19.7 | 1.14 |
68 | 12 | MA1627.1 | Wt1 | MA1627.1 | YBCCTCCCCCACVB | Wt1 | 4.3e-11 | 4.78e-13 | 12500 | 75 | 30000 | 72 | 1.04 |
69 | 32 | MA1719.1 | ZNF816 | MA1719.1 | DRRKGGGGACATGCHGGG | ZNF816 | 6.74e-11 | 7.38e-13 | 5070 | 30.5 | 11500 | 27.4 | 1.11 |
70 | 33 | MA1981.1 | ZNF530 | MA1981.1 | GMARGGMRAGGGGCRGSV | ZNF530 | 7.67e-11 | 8.27e-13 | 7510 | 45.2 | 17500 | 41.9 | 1.08 |
71 | 14 | MA1581.1 | ZBTB6 | MA1581.1 | DNSCTTGAGCCNV | ZBTB6 | 7.94e-10 | 8.44e-12 | 7930 | 47.7 | 18600 | 44.5 | 1.07 |
72 | 34 | MA0527.1 | ZBTB33 | MA0527.1 | BTCTCGCGAGABYTS | ZBTB33 | 1.03e-09 | 1.08e-11 | 2600 | 15.6 | 5580 | 13.4 | 1.17 |
73 | 28 | MA1990.1 | Gli1 | MA1990.1 | NNRGACCACCCASV | Gli1 | 1.35e-09 | 1.4e-11 | 7540 | 45.4 | 17600 | 42.2 | 1.08 |
74 | 21 | MA1615.1 | Plagl1 | MA1615.1 | NNCTGGGGCCABN | Plagl1 | 1.62e-09 | 1.66e-11 | 2440 | 14.7 | 5230 | 12.5 | 1.17 |
75 | 12 | MA0733.1 | EGR4 | MA0733.1 | NHMCGCCCACGCAHWT | EGR4 | 2.12e-09 | 2.14e-11 | 2100 | 12.6 | 4430 | 10.6 | 1.19 |
76 | 28 | MA0734.3 | Gli2 | MA0734.3 | NRGACCACCCASV | Gli2 | 2.18e-09 | 2.17e-11 | 5010 | 30.2 | 11400 | 27.3 | 1.11 |
77 | 2 | MA1587.1 | ZNF135 | MA1587.1 | CCTCGACCTCCYRR | ZNF135 | 2.46e-09 | 2.41e-11 | 4000 | 24.1 | 8950 | 21.4 | 1.12 |
78 | 28 | MA0695.1 | ZBTB7C | MA0695.1 | RCGACCACCGAN | ZBTB7C | 1.02e-08 | 9.88e-11 | 11300 | 68.3 | 27300 | 65.4 | 1.04 |
79 | 35 | MA0830.2 | TCF4 | MA0830.2 | NNGCACCTGCCNN | TCF4 | 2.26e-08 | 2.16e-10 | 7660 | 46.2 | 18000 | 43.2 | 1.07 |
80 | 35 | MA0823.1 | HEY1 | MA0823.1 | GRCACGTGCC | HEY1 | 3.19e-08 | 3.01e-10 | 3330 | 20.1 | 7400 | 17.7 | 1.13 |
81 | 21 | MA0470.2 | E2F4 | MA0470.2 | TTTTGGCGCCAWWW | E2F4 | 3.56e-08 | 3.32e-10 | 1660 | 10 | 3460 | 8.29 | 1.21 |
82 | 6 | MA0528.2 | ZNF263 | MA0528.2 | VNGGGGAGGASG | ZNF263 | 4.35e-08 | 4.01e-10 | 2730 | 16.4 | 5960 | 14.3 | 1.15 |
83 | 36 | MA0632.2 | TCFL5 | MA0632.2 | KCACGCGCMC | TCFL5 | 1.19e-07 | 1.08e-09 | 661 | 3.98 | 1220 | 2.93 | 1.36 |
84 | 35 | MA0259.1 | ARNT::HIF1A | MA0259.1 | VBACGTGC | ARNT::HIF1A | 1.21e-07 | 1.09e-09 | 3960 | 23.9 | 8950 | 21.4 | 1.11 |
85 | 11 | MA1569.1 | TFAP2E | MA1569.1 | YGCCYSRGGCD | TFAP2E | 1.53e-07 | 1.36e-09 | 1160 | 6.99 | 2340 | 5.6 | 1.25 |
86 | 35 | MA1966.1 | TFAP4::ETV1 | MA1966.1 | RSCGGAAGCAGSTGKN | TFAP4::ETV1 | 1.89e-07 | 1.66e-09 | 4960 | 29.9 | 11400 | 27.3 | 1.09 |
87 | 12 | MA1656.1 | ZNF449 | MA1656.1 | NNAAGCCCAACCNN | ZNF449 | 2.12e-07 | 1.84e-09 | 7400 | 44.6 | 17400 | 41.8 | 1.07 |
88 | 35 | MA1106.1 | HIF1A | MA1106.1 | SBACGTGCNN | HIF1A | 2.32e-07 | 2e-09 | 6620 | 39.9 | 15500 | 37.1 | 1.07 |
89 | 37 | MA1548.1 | PLAGL2 | MA1548.1 | NGGGCCCCCN | PLAGL2 | 3.48e-07 | 2.95e-09 | 993 | 5.98 | 1970 | 4.72 | 1.27 |
90 | 21 | MA0864.2 | E2F2 | MA0864.2 | RWTTTGGCGCCAWWWY | E2F2 | 4.28e-07 | 3.59e-09 | 1280 | 7.74 | 2640 | 6.31 | 1.23 |
91 | 19 | MA1684.1 | Foxn1 | MA1684.1 | RGAMGC | Foxn1 | 4.35e-07 | 3.61e-09 | 715 | 4.31 | 1360 | 3.25 | 1.33 |
92 | 29 | MA0006.1 | Ahr::Arnt | MA0006.1 | YGCGTG | Ahr::Arnt | 4.78e-07 | 3.93e-09 | 2370 | 14.3 | 5180 | 12.4 | 1.15 |
93 | 38 | MA1973.1 | ZKSCAN3 | MA1973.1 | KDKCCCAGGCTAGCCCABNN | ZKSCAN3 | 5.07e-07 | 4.12e-09 | 3910 | 23.6 | 8870 | 21.2 | 1.11 |
94 | 35 | MA0597.2 | THAP1 | MA0597.2 | VSGCAGGGCASV | THAP1 | 6.89e-07 | 5.54e-09 | 3570 | 21.5 | 8040 | 19.3 | 1.12 |
95 | 39 | MA0511.2 | RUNX2 | MA0511.2 | WAACCGCAA | RUNX2 | 9.25e-07 | 7.36e-09 | 11600 | 70.1 | 28200 | 67.5 | 1.04 |
96 | 35 | MA1945.1 | ETV5::FIGLA | MA1945.1 | RGCGGAARCAGGTGG | ETV5::FIGLA | 1.05e-06 | 8.22e-09 | 6730 | 40.5 | 15800 | 37.9 | 1.07 |
97 | 35 | MA1941.1 | ETV2::FIGLA | MA1941.1 | RSCGGAARCAGSTGBN | ETV2::FIGLA | 1.06e-06 | 8.22e-09 | 9000 | 54.2 | 21500 | 51.5 | 1.05 |
98 | 30 | MA0067.2 | PAX2 | MA0067.2 | BCAVTSRAGCGTGACGR | PAX2 | 1.22e-06 | 9.4e-09 | 1190 | 7.16 | 2430 | 5.82 | 1.23 |
99 | 13 | MA1930.1 | CTCF | MA1930.1 | CTGCAGTKCCNVCHNNYRGCCASYAGRKGGCRAKC | CTCF | 1.44e-06 | 1.1e-08 | 3860 | 23.2 | 8770 | 21 | 1.11 |
100 | 23 | MA1584.1 | ZIC5 | MA1584.1 | VGACCCCCCGCTGHGM | ZIC5 | 2.53e-06 | 1.91e-08 | 2750 | 16.6 | 6120 | 14.6 | 1.13 |
101 | 40 | MA0615.1 | Gmeb1 | MA0615.1 | BHBBKKACGTMMNWNNN | Gmeb1 | 2.62e-06 | 1.96e-08 | 4180 | 25.1 | 9540 | 22.9 | 1.1 |
102 | 35 | MA1631.1 | ASCL1 | MA1631.1 | NNGCACCTGCYNB | ASCL1 | 2.84e-06 | 2.1e-08 | 1990 | 12 | 4300 | 10.3 | 1.16 |
103 | 35 | MA0821.2 | HES5 | MA0821.2 | GRCACGTGYC | HES5 | 2.89e-06 | 2.12e-08 | 10400 | 62.9 | 25200 | 60.3 | 1.04 |
104 | 21 | MA1727.1 | ZNF417 | MA1727.1 | VRBVNTGGGCGCCAM | ZNF417 | 4.81e-06 | 3.49e-08 | 2470 | 14.9 | 5450 | 13.1 | 1.14 |
105 | 41 | MA1964.1 | SMAD2 | MA1964.1 | NBCCAGACDN | SMAD2 | 5.09e-06 | 3.66e-08 | 7550 | 45.4 | 17900 | 42.8 | 1.06 |
106 | 42 | MA1716.1 | ZNF76 | MA1716.1 | NNTWCCCAYAATGCAHYGCGC | ZNF76 | 9.85e-06 | 7.02e-08 | 4730 | 28.5 | 10900 | 26.2 | 1.09 |
107 | 23 | MA0696.1 | ZIC1 | MA0696.1 | GACCCCCYGCTGTG | ZIC1 | 1.13e-05 | 7.98e-08 | 2620 | 15.8 | 5840 | 14 | 1.13 |
108 | 4 | MA1653.1 | ZNF148 | MA1653.1 | YBCCCCTCCCCC | ZNF148 | 1.29e-05 | 9.06e-08 | 385 | 2.32 | 676 | 1.62 | 1.43 |
109 | 43 | MA1982.1 | ZNF574 | MA1982.1 | VGSCTAGAGMGGCCS | ZNF574 | 1.42e-05 | 9.87e-08 | 3120 | 18.8 | 7050 | 16.9 | 1.11 |
110 | 44 | MA1710.1 | ZNF257 | MA1710.1 | VDGAGGCRAGRG | ZNF257 | 1.49e-05 | 1.02e-07 | 10800 | 65.2 | 26200 | 62.8 | 1.04 |
111 | 45 | MA0145.2 | Tfcp2l1 | MA0145.2 | CCAGYYYVADCCRG | Tfcp2l1 | 1.66e-05 | 1.13e-07 | 5190 | 31.2 | 12100 | 28.9 | 1.08 |
112 | 46 | MA0750.2 | ZBTB7A | MA0750.2 | NVCCGGAAGTGSV | ZBTB7A | 1.84e-05 | 1.24e-07 | 1830 | 11 | 3960 | 9.5 | 1.16 |
113 | 47 | MA1599.1 | ZNF682 | MA1599.1 | BVGGCYAAGCCCCWNY | ZNF682 | 3.07e-05 | 2.05e-07 | 11200 | 67.6 | 27300 | 65.3 | 1.04 |
114 | 11 | MA0815.1 | TFAP2C | MA0815.1 | TGCCCYSRGGGCA | TFAP2C | 3.46e-05 | 2.29e-07 | 2410 | 14.5 | 5350 | 12.8 | 1.13 |
115 | 11 | MA0872.1 | TFAP2A | MA0872.1 | TGCCCYSRGGGCA | TFAP2A | 4.15e-05 | 2.73e-07 | 2280 | 13.7 | 5040 | 12.1 | 1.14 |
116 | 48 | MA1972.1 | ZFP14 | MA1972.1 | GGAGGCHCHGGARBG | ZFP14 | 4.7e-05 | 3.06e-07 | 8510 | 51.2 | 20400 | 48.8 | 1.05 |
117 | 49 | MA1542.1 | OSR1 | MA1542.1 | HGCTACYGTD | OSR1 | 7.2e-05 | 4.65e-07 | 12300 | 73.8 | 29900 | 71.7 | 1.03 |
118 | 13 | MA0155.1 | INSM1 | MA0155.1 | TGYCWGGGGGCR | INSM1 | 8.36e-05 | 5.35e-07 | 3800 | 22.9 | 8720 | 20.9 | 1.09 |
119 | 13 | MA1929.1 | CTCF | MA1929.1 | CTGCAGTKCCNNNNNYNRCCASYAGRKGGCRSWG | CTCF | 1e-04 | 6.35e-07 | 2310 | 13.9 | 5130 | 12.3 | 1.13 |
120 | 50 | MA1628.1 | Zic1::Zic2 | MA1628.1 | CVCAGCAGGNV | Zic1::Zic2 | 0.000111 | 6.98e-07 | 3940 | 23.7 | 9080 | 21.8 | 1.09 |
121 | 35 | MA0104.4 | MYCN | MA0104.4 | VVCCACGTGGBB | MYCN | 0.000123 | 7.7e-07 | 10900 | 65.8 | 26500 | 63.5 | 1.04 |
122 | 11 | MA0813.1 | TFAP2B | MA0813.1 | TGCCCYBRGGGCA | TFAP2B | 0.000125 | 7.73e-07 | 2560 | 15.4 | 5740 | 13.8 | 1.12 |
123 | 51 | MA0767.1 | GCM2 | MA0767.1 | BATGCGGGTR | GCM2 | 0.000175 | 1.07e-06 | 3630 | 21.8 | 8330 | 20 | 1.09 |
124 | 35 | MA1464.1 | ARNT2 | MA1464.1 | GTCACGTGMN | ARNT2 | 0.000203 | 1.24e-06 | 6530 | 39.3 | 15500 | 37.1 | 1.06 |
125 | 52 | MA1979.1 | ZNF416 | MA1979.1 | TATCTGGGCAH | ZNF416 | 0.000223 | 1.35e-06 | 8950 | 53.9 | 21500 | 51.6 | 1.04 |
126 | 35 | MA0147.3 | MYC | MA0147.3 | NNCCACGTGCNB | MYC | 0.000242 | 1.45e-06 | 8490 | 51.1 | 20400 | 48.8 | 1.05 |
127 | 35 | MA0697.2 | Zic3 | MA0697.2 | CNCAGCAGGAGNN | Zic3 | 0.000263 | 1.56e-06 | 6140 | 37 | 14500 | 34.8 | 1.06 |
128 | 35 | MA1099.2 | HES1 | MA1099.2 | GGCRCGTGBC | HES1 | 0.000328 | 1.94e-06 | 1110 | 6.69 | 2340 | 5.6 | 1.19 |
129 | 50 | MA1629.1 | Zic2 | MA1629.1 | NDCACAGCAGGDRG | Zic2 | 0.000373 | 2.19e-06 | 10800 | 64.9 | 26200 | 62.7 | 1.03 |
130 | 51 | MA0646.1 | GCM1 | MA0646.1 | BATGCGGGTAC | GCM1 | 0.000406 | 2.36e-06 | 5950 | 35.8 | 14100 | 33.7 | 1.06 |
131 | 35 | MA0649.1 | HEY2 | MA0649.1 | GRCACGTGYC | HEY2 | 0.000978 | 5.64e-06 | 4350 | 26.2 | 10100 | 24.3 | 1.08 |
132 | 35 | MA0616.2 | HES2 | MA0616.2 | GGCACGTGYC | HES2 | 0.00107 | 6.12e-06 | 5770 | 34.8 | 13700 | 32.7 | 1.06 |
133 | 53 | MA1971.1 | ZBED2 | MA1971.1 | NNMMCGAAACCNNV | ZBED2 | 0.00115 | 6.52e-06 | 7600 | 45.8 | 18200 | 43.6 | 1.05 |
134 | 54 | MA1555.1 | RXRB | MA1555.1 | RRGGTCATGACCYY | RXRB | 0.00134 | 7.55e-06 | 4650 | 28 | 10900 | 26.1 | 1.07 |
135 | 55 | MA1474.1 | CREB3L4 | MA1474.1 | YGCCACGTCAYC | CREB3L4 | 0.00305 | 1.71e-05 | 7440 | 44.8 | 17800 | 42.7 | 1.05 |
136 | 56 | MA1557.1 | SMAD5 | MA1557.1 | YGTCTAGACA | SMAD5 | 0.00326 | 1.81e-05 | 4190 | 25.2 | 9790 | 23.5 | 1.08 |
10 | 61 | 10-AGGATTGCTTG | STREME-10 | 10-AGGATTGCTTG | 10-AGGATTGCTTG | AGGATTGCTTG | 0.00337 | 0.000337 | 543 | 3.27 | 1140 | 2.72 | 1.2 |
137 | 46 | MA0759.2 | ELK3 | MA0759.2 | ACCGGAAGTRV | ELK3 | 0.00353 | 1.95e-05 | 1570 | 9.43 | 3450 | 8.27 | 1.14 |
138 | 56 | MA0861.1 | TP73 | MA0861.1 | DRCATGTCNNRACAYGYM | TP73 | 0.0036 | 1.97e-05 | 3640 | 21.9 | 8450 | 20.2 | 1.08 |
139 | 46 | MA0764.3 | ETV4 | MA0764.3 | ACCGGAAGTR | ETV4 | 0.00425 | 2.31e-05 | 5820 | 35 | 13800 | 33.1 | 1.06 |
140 | 35 | MA0603.1 | Arntl | MA0603.1 | NGTCACGTGH | Arntl | 0.00504 | 2.72e-05 | 8870 | 53.4 | 21500 | 51.4 | 1.04 |
141 | 35 | MA1997.1 | Olig2 | MA1997.1 | NVCAGCTGBN | Olig2 | 0.00585 | 3.14e-05 | 5660 | 34.1 | 13400 | 32.2 | 1.06 |
142 | 46 | MA0765.3 | ETV5 | MA0765.3 | ACCGGAAGTR | ETV5 | 0.00642 | 3.42e-05 | 3460 | 20.9 | 8040 | 19.3 | 1.08 |
143 | 57 | MA1528.1 | NFIX | MA1528.1 | STTGGCRNNGTGCCARB | NFIX | 0.00654 | 3.45e-05 | 3670 | 22.1 | 8550 | 20.5 | 1.08 |
144 | 57 | MA0671.1 | NFIX | MA0671.1 | NNTGCCAAN | NFIX | 0.00838 | 4.4e-05 | 16300 | 98.1 | 40700 | 97.5 | 1.01 |
145 | 35 | MA1652.1 | ZKSCAN5 | MA1652.1 | NNRGGARGTGAGRR | ZKSCAN5 | 0.00999 | 5.2e-05 | 8250 | 49.7 | 19900 | 47.8 | 1.04 |
146 | 58 | MA1730.1 | ZNF708 | MA1730.1 | NNRGCTGTRCCTCCY | ZNF708 | 0.0117 | 6.03e-05 | 7550 | 45.5 | 18200 | 43.6 | 1.04 |
147 | 35 | MA1993.1 | Neurod2 | MA1993.1 | VVCAGCTGBB | Neurod2 | 0.0129 | 6.61e-05 | 5540 | 33.4 | 13200 | 31.6 | 1.06 |
148 | 35 | MA0633.2 | Twist2 | MA0633.2 | NVCAGCTGBN | Twist2 | 0.0132 | 6.74e-05 | 5130 | 30.9 | 12200 | 29.1 | 1.06 |
149 | 35 | MA0103.3 | ZEB1 | MA0103.3 | SNCACCTGSVN | ZEB1 | 0.0162 | 8.21e-05 | 14100 | 84.8 | 34800 | 83.4 | 1.02 |
150 | 59 | MA1715.1 | ZNF707 | MA1715.1 | CCCCACTCCTGGTAC | ZNF707 | 0.0195 | 9.83e-05 | 3310 | 19.9 | 7700 | 18.4 | 1.08 |
151 | 60 | GCCTGTAATCCCAGC | MEME-1 | GCCTGTAATCCCAGC | GCCTGTAATCCCAGC | GCCTGTAATCCCAGC | 0.021 | 0.000105 | 280 | 1.69 | 518 | 1.24 | 1.36 |
152 | 35 | MA0822.1 | HES7 | MA0822.1 | YGGCACGTGCCR | HES7 | 0.0212 | 0.000105 | 750 | 4.52 | 1580 | 3.78 | 1.2 |
153 | 60 | MA0712.2 (OTX2) | STREME-7 | 7-CCTGTAATCCCAGC | CCTGTAATCCCAGC | OTX2 | 0.0394 | 0.000195 | 290 | 1.75 | 546 | 1.31 | 1.34 |
11 | 62 | MA1616.1 (Prdm15) | STREME-9 | 9-GAACTCCTGGRCT | GAACTCCTGGRCT | Prdm15 | 0.246 | 0.0224 | 40 | 0.24 | 64 | 0.15 | 1.59 |
12 | 63 | MA1125.1 (ZNF384) | MEME-2 | HWAAAAAAAAAAAAA | HWAAAAAAAAAAAAA | ZNF384 | 0.636 | 0.0516 | 452 | 2.72 | 1030 | 2.47 | 1.1 |
13 | 64 | 11-AGGTTGCAGTGA | STREME-11 | 11-AGGTTGCAGTGA | 11-AGGTTGCAGTGA | AGGTTGCAGTGA | 0.67 | 0.0516 | 34 | 0.2 | 58 | 0.14 | 1.49 |
14 | 65 | MA0478.1 (FOSL2) | MEME-3 | TGASTCAT | TGASTCAT | FOSL2 | 4.32 | 0.309 | 35 | 0.21 | 78 | 0.19 | 1.15 |
DOX_24_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DOX_24_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DOX_24_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*2), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./2,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DOX_24_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
EPI_3_DAR_DEG_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/EPI_3_DEG_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
EPI_3_DAR_DEG_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/EPI_3_DEG_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
EPI_3_DAR_DEG_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/EPI_3_DEG_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
EPI_3_DAR_DEG_sea_all <- rbind(EPI_3_DAR_DEG_sea_disc, EPI_3_DAR_DEG_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
EPI_3_DAR_DEG_combo <- EPI_3_DAR_DEG_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., EPI_3_DAR_DEG_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
EPI_3_DAR_DEG_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in MTX v VEH") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA0117.2 (Mafb) | STREME-1 | 1-MATGTACTGACTDCA | MATGTACTGACTDCA | Mafb | 1.45e-08 | 1.45e-08 | 11 | 8.46 | 49 | 0.39 | 23.2 |
2 | 2 | 2-AATACAGTTCSA | STREME-2 | 2-AATACAGTTCSA | AATACAGTTCSA | AATACAGTTCSA | 0.0117 | 0.00583 | 9 | 6.92 | 283 | 2.23 | 3.41 |
3 | 3 | 3-CRGGGTTTKAGW | STREME-3 | 3-CRGGGTTTKAGW | CRGGGTTTKAGW | CRGGGTTTKAGW | 0.0977 | 0.0326 | 12 | 9.23 | 611 | 4.82 | 2.06 |
4 | 4 | MA1125.1 (ZNF384) | MEME-1 | WBTTTTTKTTTKTTT | WBTTTTTKTTTKTTT | ZNF384 | 0.142 | 0.0354 | 3 | 2.31 | 68 | 0.54 | 5.61 |
EPI_3_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("EPI_3_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
EPI_3_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*.5), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./.5,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("EPI_3_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
EPI_24_DAR_DEG_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/EPI_24_DEG_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
EPI_24_DAR_DEG_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/EPI_24_DEG_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
EPI_24_DAR_DEG_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/EPI_24_DEG_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
EPI_24_DAR_DEG_sea_all <- rbind(EPI_24_DAR_DEG_sea_disc, EPI_24_DAR_DEG_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
EPI_24_DAR_DEG_combo <- EPI_24_DAR_DEG_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., EPI_24_DAR_DEG_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
EPI_24_DAR_DEG_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in MTX v VEH") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA1596.1 (ZNF460) | STREME-3 | 3-CGCCTCCC | CGCCTCCC | ZNF460 | 6.47e-22 | 4.98e-22 | 2400 | 15.2 | 5310 | 12 | 1.27 |
2 | 2 | MA1564.1 | SP9 | MA1564.1 | RCCACGCCCMCY | SP9 | 6.19e-20 | 2.38e-20 | 3170 | 20.2 | 7360 | 16.7 | 1.21 |
3 | 2 | MA1513.1 | KLF15 | MA1513.1 | SCCCCGCCCCS | KLF15 | 1.3e-19 | 3.34e-20 | 2010 | 12.8 | 4410 | 9.99 | 1.28 |
4 | 2 | MA0746.2 | SP3 | MA0746.2 | NGCCACGCCCMCH | SP3 | 4.4e-18 | 8.46e-19 | 3530 | 22.5 | 8380 | 19 | 1.18 |
5 | 3 | MA1713.1 | ZNF610 | MA1713.1 | SSCGCCGCTCCSSS | ZNF610 | 1.06e-17 | 1.45e-18 | 2020 | 12.9 | 4480 | 10.2 | 1.27 |
6 | 20 | MA1653.1 | ZNF148 | MA1653.1 | YBCCCCTCCCCC | ZNF148 | 1.13e-17 | 1.45e-18 | 1870 | 11.9 | 4100 | 9.29 | 1.28 |
7 | 4 | 5-BCSSGV | STREME-5 | 5-BCSSGV | BCSSGV | BCSSGV | 4.6e-17 | 5.05e-18 | 2190 | 13.9 | 4930 | 11.2 | 1.25 |
8 | 5 | MA1596.1 | ZNF460 | MA1596.1 | GCCTCMGCCTCCCRAG | ZNF460 | 5.48e-17 | 5.27e-18 | 2130 | 13.6 | 4780 | 10.8 | 1.25 |
9 | 6 | 1-CCCGGCG | STREME-1 | 1-CCCGGCG | CCCGGCG | CCCGGCG | 1.61e-16 | 1.37e-17 | 2280 | 14.6 | 5190 | 11.8 | 1.24 |
10 | 7 | MA1650.1 | ZBTB14 | MA1650.1 | SSCCGCGCACVS | ZBTB14 | 3.11e-16 | 2.39e-17 | 2430 | 15.5 | 5580 | 12.6 | 1.23 |
11 | 8 | 2-CGCGCG | STREME-2 | 2-CGCGCG | CGCGCG | CGCGCG | 5.66e-16 | 3.95e-17 | 3200 | 20.4 | 7590 | 17.2 | 1.19 |
12 | 2 | MA1511.2 | KLF10 | MA1511.2 | GGGGCGGGG | KLF10 | 8.45e-16 | 5.41e-17 | 2880 | 18.4 | 6770 | 15.3 | 1.2 |
13 | 9 | MA0146.2 | Zfx | MA0146.2 | SSSGCCBVGGCCTS | Zfx | 9.26e-16 | 5.48e-17 | 4720 | 30 | 11600 | 26.4 | 1.14 |
14 | 10 | MA0506.2 | Nrf1 | MA0506.2 | SNCTGCGCMTGCGCV | Nrf1 | 2.91e-15 | 1.6e-16 | 2810 | 17.9 | 6600 | 14.9 | 1.2 |
15 | 2 | MA1961.1 | PATZ1 | MA1961.1 | SSGGGGMGGGGS | PATZ1 | 8.51e-15 | 4.36e-16 | 6730 | 42.9 | 17200 | 39 | 1.1 |
16 | 11 | MA1102.2 | CTCFL | MA1102.2 | NSCAGGGGGCGS | CTCFL | 1.53e-14 | 7.35e-16 | 2430 | 15.5 | 5640 | 12.8 | 1.21 |
17 | 2 | MA0742.2 | KLF12 | MA0742.2 | GGGGCGGGG | KLF12 | 6.3e-14 | 2.85e-15 | 2990 | 19 | 7120 | 16.1 | 1.18 |
18 | 2 | MA0740.2 | KLF14 | MA0740.2 | KGGGCGGGG | KLF14 | 8.44e-14 | 3.6e-15 | 2250 | 14.3 | 5200 | 11.8 | 1.22 |
19 | 12 | MA1976.1 | ZNF320 | MA1976.1 | DGKGGGRCCMRGGGGCCWGTGHVNNNN | ZNF320 | 3e-13 | 1.22e-14 | 4360 | 27.8 | 10800 | 24.5 | 1.13 |
20 | 2 | MA0079.5 | SP1 | MA0079.5 | GGGGCGGGG | SP1 | 7.85e-13 | 3.02e-14 | 3410 | 21.7 | 8280 | 18.8 | 1.16 |
21 | 2 | MA1515.1 | KLF2 | MA1515.1 | NRCCACRCCCH | KLF2 | 1.46e-12 | 5.27e-14 | 2400 | 15.3 | 5640 | 12.8 | 1.2 |
22 | 2 | MA1512.1 | KLF11 | MA1512.1 | SCCACGCCCMC | KLF11 | 1.51e-12 | 5.27e-14 | 3300 | 21 | 8000 | 18.1 | 1.16 |
23 | 13 | MA1583.1 | ZFP57 | MA1583.1 | NNVTTGCCGCANN | ZFP57 | 1.58e-12 | 5.28e-14 | 3540 | 22.5 | 8630 | 19.5 | 1.15 |
24 | 14 | MA1712.1 | ZNF454 | MA1712.1 | DGGCTCSSGGCCCYGVKG | ZNF454 | 1.69e-12 | 5.43e-14 | 2740 | 17.4 | 6520 | 14.8 | 1.18 |
25 | 15 | MA0810.1 | TFAP2A | MA0810.1 | YGCCCBVRGGCR | TFAP2A | 2.14e-12 | 6.59e-14 | 2890 | 18.4 | 6930 | 15.7 | 1.17 |
26 | 2 | MA0162.4 | EGR1 | MA0162.4 | VCMCGCCCACGCVS | EGR1 | 2.32e-12 | 6.87e-14 | 2160 | 13.8 | 5020 | 11.4 | 1.21 |
27 | 13 | MA1122.1 | TFDP1 | MA1122.1 | VSGCGGGAAVN | TFDP1 | 2.79e-12 | 7.95e-14 | 4280 | 27.2 | 10600 | 24.1 | 1.13 |
28 | 16 | MA0823.1 | HEY1 | MA0823.1 | GRCACGTGCC | HEY1 | 4.86e-12 | 1.33e-13 | 5790 | 36.9 | 14800 | 33.5 | 1.1 |
29 | 17 | MA1986.1 | ZNF692 | MA1986.1 | BGGGCCCASN | ZNF692 | 6.4e-12 | 1.7e-13 | 5380 | 34.2 | 13600 | 30.9 | 1.11 |
30 | 13 | MA0471.2 | E2F6 | MA0471.2 | VVDGGCGGGAAVV | E2F6 | 1.29e-11 | 3.31e-13 | 1480 | 9.41 | 3300 | 7.47 | 1.26 |
31 | 2 | MA0747.1 | SP8 | MA0747.1 | RCCACGCCCMCY | SP8 | 3.59e-11 | 8.73e-13 | 3410 | 21.7 | 8360 | 18.9 | 1.15 |
32 | 2 | MA1959.1 | KLF7 | MA1959.1 | DGGGCGGGG | KLF7 | 3.63e-11 | 8.73e-13 | 6670 | 42.5 | 17200 | 39.1 | 1.09 |
33 | 2 | MA1516.1 | KLF3 | MA1516.1 | GRCCRCGCCCH | KLF3 | 4.91e-11 | 1.14e-12 | 5790 | 36.9 | 14800 | 33.6 | 1.1 |
34 | 2 | MA0039.4 | KLF4 | MA0039.4 | NNCCCCACCCHN | KLF4 | 5.03e-11 | 1.14e-12 | 5350 | 34 | 13600 | 30.8 | 1.1 |
35 | 20 | MA1630.2 | ZNF281 | MA1630.2 | GGGGGAGGGGVV | ZNF281 | 6.56e-11 | 1.44e-12 | 2170 | 13.8 | 5110 | 11.6 | 1.2 |
36 | 2 | MA0741.1 | KLF16 | MA0741.1 | GMCACGCCCCC | KLF16 | 7.44e-11 | 1.59e-12 | 6970 | 44.4 | 18100 | 41 | 1.08 |
37 | 2 | MA0753.2 | ZNF740 | MA0753.2 | CYRCCCCCCCCAC | ZNF740 | 8.44e-11 | 1.73e-12 | 3110 | 19.8 | 7580 | 17.2 | 1.15 |
38 | 12 | MA1723.1 | PRDM9 | MA1723.1 | RGDGGGVAGGGRGGVRRMARVARR | PRDM9 | 8.55e-11 | 1.73e-12 | 6750 | 43 | 17500 | 39.6 | 1.08 |
39 | 20 | MA1522.1 | MAZ | MA1522.1 | BBCCCCTCCCY | MAZ | 9.72e-11 | 1.92e-12 | 7070 | 45 | 18400 | 41.6 | 1.08 |
40 | 2 | MA0685.2 | SP4 | MA0685.2 | GGGGCGGGG | SP4 | 1.11e-10 | 2.14e-12 | 4730 | 30.1 | 11900 | 27.1 | 1.11 |
41 | 18 | MA0131.2 | HINFP | MA0131.2 | CARCGTCCGCKN | HINFP | 1.75e-10 | 3.28e-12 | 2230 | 14.2 | 5280 | 12 | 1.19 |
42 | 15 | MA0524.2 | TFAP2C | MA0524.2 | YGCCYBVRGGCA | TFAP2C | 2.82e-10 | 5.16e-12 | 6920 | 44.1 | 18000 | 40.8 | 1.08 |
43 | 19 | MA0814.2 | TFAP2C | MA0814.2 | NNGCCTGAGGCGNN | TFAP2C | 3.36e-10 | 6.01e-12 | 7760 | 49.4 | 20300 | 46.1 | 1.07 |
44 | 20 | MA1653.1 (ZNF148) | STREME-6 | 6-CCTCCCC | CCTCCCC | ZNF148 | 5.46e-10 | 9.5e-12 | 9600 | 61.2 | 25600 | 57.9 | 1.06 |
45 | 2 | MA0733.1 | EGR4 | MA0733.1 | NHMCGCCCACGCAHWT | EGR4 | 5.56e-10 | 9.5e-12 | 5960 | 37.9 | 15400 | 34.8 | 1.09 |
46 | 21 | MA1548.1 | PLAGL2 | MA1548.1 | NGGGCCCCCN | PLAGL2 | 5.82e-10 | 9.74e-12 | 3710 | 23.6 | 9230 | 20.9 | 1.13 |
47 | 2 | MA0599.1 | KLF5 | MA0599.1 | GCCCCDCCCH | KLF5 | 7.54e-10 | 1.23e-11 | 2860 | 18.2 | 6960 | 15.8 | 1.16 |
48 | 22 | MA1721.1 | ZNF93 | MA1721.1 | GGYAGCRGCAGCGGYG | ZNF93 | 7.78e-10 | 1.25e-11 | 977 | 6.22 | 2100 | 4.75 | 1.31 |
49 | 2 | MA0516.3 | SP2 | MA0516.3 | GGGGCGGGG | SP2 | 1.16e-09 | 1.82e-11 | 8880 | 56.6 | 23500 | 53.3 | 1.06 |
50 | 2 | MA1514.1 | KLF17 | MA1514.1 | MMCCACGCACCCMTY | KLF17 | 1.51e-09 | 2.32e-11 | 6340 | 40.4 | 16400 | 37.2 | 1.08 |
51 | 15 | MA0811.1 | TFAP2B | MA0811.1 | YGCCCBVRGGCA | TFAP2B | 1.84e-09 | 2.77e-11 | 7260 | 46.3 | 19000 | 43.1 | 1.07 |
52 | 5 | MA1587.1 | ZNF135 | MA1587.1 | CCTCGACCTCCYRR | ZNF135 | 3.88e-09 | 5.74e-11 | 5990 | 38.1 | 15500 | 35.1 | 1.09 |
53 | 15 | MA0812.1 | TFAP2B | MA0812.1 | HGCCTSAGGCD | TFAP2B | 4.44e-09 | 6.44e-11 | 6550 | 41.7 | 17000 | 38.6 | 1.08 |
54 | 23 | MA0865.2 | E2F8 | MA0865.2 | TTCCCGCCAHWA | E2F8 | 9.94e-09 | 1.42e-10 | 4500 | 28.7 | 11400 | 25.9 | 1.11 |
55 | 13 | MA1615.1 | Plagl1 | MA1615.1 | NNCTGGGGCCABN | Plagl1 | 2.27e-08 | 3.18e-10 | 12200 | 77.5 | 33100 | 74.9 | 1.03 |
56 | 2 | MA0493.2 | KLF1 | MA0493.2 | DGGGYGGGG | KLF1 | 3.41e-08 | 4.68e-10 | 1130 | 7.22 | 2530 | 5.74 | 1.26 |
57 | 24 | MA0751.1 | ZIC4 | MA0751.1 | GRCCCCCCGCKGYGH | ZIC4 | 4.38e-08 | 5.8e-10 | 1060 | 6.76 | 2360 | 5.34 | 1.27 |
58 | 23 | MA0527.1 | ZBTB33 | MA0527.1 | BTCTCGCGAGABYTS | ZBTB33 | 4.4e-08 | 5.8e-10 | 2720 | 17.3 | 6680 | 15.1 | 1.15 |
59 | 25 | MA1542.1 | OSR1 | MA1542.1 | HGCTACYGTD | OSR1 | 4.45e-08 | 5.8e-10 | 10500 | 66.6 | 28200 | 63.8 | 1.04 |
60 | 23 | MA0024.3 | E2F1 | MA0024.3 | TTTGGCGCCAAA | E2F1 | 5.81e-08 | 7.45e-10 | 1970 | 12.6 | 4710 | 10.7 | 1.18 |
61 | 26 | MA1981.1 | ZNF530 | MA1981.1 | GMARGGMRAGGGGCRGSV | ZNF530 | 6.15e-08 | 7.75e-10 | 7640 | 48.6 | 20200 | 45.7 | 1.07 |
62 | 27 | MA0694.1 | ZBTB7B | MA0694.1 | RCGACCACCGAA | ZBTB7B | 7.21e-08 | 8.95e-10 | 6660 | 42.4 | 17400 | 39.5 | 1.07 |
63 | 23 | MA0470.2 | E2F4 | MA0470.2 | TTTTGGCGCCAWWW | E2F4 | 7.34e-08 | 8.97e-10 | 1770 | 11.3 | 4180 | 9.47 | 1.19 |
64 | 28 | MA1985.1 | ZNF669 | MA1985.1 | GGGGYGAYGACCRCT | ZNF669 | 1.91e-07 | 2.3e-09 | 5500 | 35 | 14200 | 32.3 | 1.08 |
65 | 23 | MA0748.2 | YY2 | MA0748.2 | NVATGGCGGCS | YY2 | 2.75e-07 | 3.26e-09 | 4840 | 30.8 | 12500 | 28.2 | 1.09 |
66 | 29 | MA0759.2 | ELK3 | MA0759.2 | ACCGGAAGTRV | ELK3 | 5.47e-07 | 6.37e-09 | 2870 | 18.3 | 7130 | 16.2 | 1.13 |
67 | 2 | MA1517.1 | KLF6 | MA1517.1 | NRCCACGCCCH | KLF6 | 5.61e-07 | 6.44e-09 | 7610 | 48.5 | 20200 | 45.6 | 1.06 |
68 | 16 | MA0616.2 | HES2 | MA0616.2 | GGCACGTGYC | HES2 | 7.01e-07 | 7.92e-09 | 3070 | 19.6 | 7680 | 17.4 | 1.12 |
69 | 2 | MA1107.2 | KLF9 | MA1107.2 | MNGCCACACCCACHYV | KLF9 | 8.19e-07 | 9.13e-09 | 6070 | 38.6 | 15900 | 36 | 1.08 |
70 | 16 | MA0821.2 | HES5 | MA0821.2 | GRCACGTGYC | HES5 | 8.75e-07 | 9.61e-09 | 3750 | 23.9 | 9520 | 21.6 | 1.11 |
71 | 29 | MA0750.2 | ZBTB7A | MA0750.2 | NVCCGGAAGTGSV | ZBTB7A | 1.11e-06 | 1.21e-08 | 5900 | 37.6 | 15400 | 34.9 | 1.08 |
72 | 2 | MA0472.2 | EGR2 | MA0472.2 | MCGCCCACGCA | EGR2 | 1.21e-06 | 1.29e-08 | 4050 | 25.8 | 10400 | 23.4 | 1.1 |
73 | 30 | MA0597.2 | THAP1 | MA0597.2 | VSGCAGGGCASV | THAP1 | 1.44e-06 | 1.52e-08 | 5760 | 36.7 | 15000 | 34 | 1.08 |
74 | 31 | MA1599.1 | ZNF682 | MA1599.1 | BVGGCYAAGCCCCWNY | ZNF682 | 1.79e-06 | 1.86e-08 | 2150 | 13.7 | 5240 | 11.9 | 1.15 |
75 | 32 | MA1727.1 | ZNF417 | MA1727.1 | VRBVNTGGGCGCCAM | ZNF417 | 2.28e-06 | 2.34e-08 | 3030 | 19.3 | 7590 | 17.2 | 1.12 |
76 | 33 | MA1715.1 | ZNF707 | MA1715.1 | CCCCACTCCTGGTAC | ZNF707 | 2.77e-06 | 2.81e-08 | 6070 | 38.6 | 15900 | 36 | 1.07 |
77 | 34 | MA1941.1 | ETV2::FIGLA | MA1941.1 | RSCGGAARCAGSTGBN | ETV2::FIGLA | 3.58e-06 | 3.58e-08 | 5220 | 33.3 | 13600 | 30.8 | 1.08 |
78 | 27 | MA1990.1 | Gli1 | MA1990.1 | NNRGACCACCCASV | Gli1 | 4.58e-06 | 4.52e-08 | 7840 | 49.9 | 20900 | 47.3 | 1.06 |
79 | 16 | MA1099.2 | HES1 | MA1099.2 | GGCRCGTGBC | HES1 | 5.04e-06 | 4.91e-08 | 2410 | 15.4 | 5960 | 13.5 | 1.14 |
80 | 24 | MA1584.1 | ZIC5 | MA1584.1 | VGACCCCCCGCTGHGM | ZIC5 | 5.79e-06 | 5.56e-08 | 1320 | 8.39 | 3090 | 6.99 | 1.2 |
81 | 2 | MA1578.1 | VEZF1 | MA1578.1 | CCCCCCMYDH | VEZF1 | 7.7e-06 | 7.31e-08 | 4690 | 29.8 | 12100 | 27.5 | 1.09 |
82 | 35 | MA0067.2 | PAX2 | MA0067.2 | BCAVTSRAGCGTGACGR | PAX2 | 7.83e-06 | 7.34e-08 | 4530 | 28.8 | 11700 | 26.5 | 1.09 |
83 | 16 | MA1464.1 | ARNT2 | MA1464.1 | GTCACGTGMN | ARNT2 | 1.2e-05 | 1.11e-07 | 3750 | 23.9 | 9590 | 21.7 | 1.1 |
84 | 2 | MA0732.1 | EGR3 | MA0732.1 | HHMCGCCCACGCAHH | EGR3 | 1.57e-05 | 1.44e-07 | 1400 | 8.93 | 3320 | 7.53 | 1.19 |
85 | 20 | MA1965.1 | SP5 | MA1965.1 | CBCCTCCCHS | SP5 | 1.62e-05 | 1.46e-07 | 1740 | 11.1 | 4210 | 9.54 | 1.16 |
86 | 35 | MA0014.3 | PAX5 | MA0014.3 | RRGCGTGACCNN | PAX5 | 2.3e-05 | 2.06e-07 | 3270 | 20.8 | 8300 | 18.8 | 1.11 |
87 | 16 | MA0522.3 | TCF3 | MA0522.3 | NVCACCTGCNN | TCF3 | 2.65e-05 | 2.34e-07 | 12000 | 76.7 | 32900 | 74.5 | 1.03 |
88 | 36 | MA1719.1 | ZNF816 | MA1719.1 | DRRKGGGGACATGCHGGG | ZNF816 | 2.73e-05 | 2.39e-07 | 2550 | 16.3 | 6380 | 14.5 | 1.13 |
89 | 27 | MA0734.3 | Gli2 | MA0734.3 | NRGACCACCCASV | Gli2 | 2.84e-05 | 2.45e-07 | 6540 | 41.6 | 17300 | 39.2 | 1.06 |
90 | 37 | MA1982.1 | ZNF574 | MA1982.1 | VGSCTAGAGMGGCCS | ZNF574 | 4.64e-05 | 3.97e-07 | 5320 | 33.9 | 13900 | 31.6 | 1.07 |
91 | 11 | MA0155.1 | INSM1 | MA0155.1 | TGYCWGGGGGCR | INSM1 | 5.27e-05 | 4.45e-07 | 3900 | 24.8 | 10000 | 22.7 | 1.09 |
92 | 24 | MA0696.1 | ZIC1 | MA0696.1 | GACCCCCYGCTGTG | ZIC1 | 7.19e-05 | 6.01e-07 | 2190 | 13.9 | 5420 | 12.3 | 1.13 |
93 | 34 | MA1966.1 | TFAP4::ETV1 | MA1966.1 | RSCGGAAGCAGSTGKN | TFAP4::ETV1 | 7.5e-05 | 6.2e-07 | 2360 | 15 | 5890 | 13.3 | 1.13 |
94 | 20 | MA0528.2 | ZNF263 | MA0528.2 | VNGGGGAGGASG | ZNF263 | 8.3e-05 | 6.79e-07 | 2400 | 15.3 | 6000 | 13.6 | 1.13 |
95 | 19 | MA0003.4 | TFAP2A | MA0003.4 | NNKGCCTCAGGCNN | TFAP2A | 8.4e-05 | 6.8e-07 | 9460 | 60.2 | 25600 | 57.9 | 1.04 |
96 | 34 | MA1945.1 | ETV5::FIGLA | MA1945.1 | RGCGGAARCAGGTGG | ETV5::FIGLA | 8.77e-05 | 7.02e-07 | 6470 | 41.2 | 17100 | 38.8 | 1.06 |
97 | 15 | MA0813.1 | TFAP2B | MA0813.1 | TGCCCYBRGGGCA | TFAP2B | 9.11e-05 | 7.22e-07 | 2910 | 18.5 | 7370 | 16.7 | 1.11 |
98 | 16 | MA1106.1 | HIF1A | MA1106.1 | SBACGTGCNN | HIF1A | 9.59e-05 | 7.52e-07 | 7530 | 48 | 20100 | 45.6 | 1.05 |
99 | 16 | MA0464.2 | BHLHE40 | MA0464.2 | VKCACGTGMC | BHLHE40 | 0.000107 | 8.31e-07 | 6800 | 43.3 | 18100 | 40.9 | 1.06 |
100 | 15 | MA0872.1 | TFAP2A | MA0872.1 | TGCCCYSRGGGCA | TFAP2A | 0.000123 | 9.48e-07 | 2250 | 14.3 | 5600 | 12.7 | 1.13 |
101 | 27 | MA0695.1 | ZBTB7C | MA0695.1 | RCGACCACCGAN | ZBTB7C | 0.000132 | 1e-06 | 5280 | 33.6 | 13900 | 31.4 | 1.07 |
102 | 38 | MA1656.1 | ZNF449 | MA1656.1 | NNAAGCCCAACCNN | ZNF449 | 0.000149 | 1.12e-06 | 12400 | 79.1 | 34100 | 77.2 | 1.03 |
103 | 39 | MA0052.4 (MEF2A) | STREME-7 | 7-TACTAAAAATACA | TACTAAAAATACA | MEF2A | 0.000218 | 1.63e-06 | 130 | 0.83 | 204 | 0.46 | 1.8 |
104 | 16 | MA0830.2 | TCF4 | MA0830.2 | NNGCACCTGCCNN | TCF4 | 0.000268 | 1.98e-06 | 7020 | 44.7 | 18700 | 42.4 | 1.05 |
105 | 15 | MA0815.1 | TFAP2C | MA0815.1 | TGCCCYSRGGGCA | TFAP2C | 0.000284 | 2.08e-06 | 2650 | 16.9 | 6700 | 15.2 | 1.11 |
106 | 19 | MA1581.1 | ZBTB6 | MA1581.1 | DNSCTTGAGCCNV | ZBTB6 | 0.000444 | 3.22e-06 | 10900 | 69.3 | 29700 | 67.2 | 1.03 |
107 | 16 | MA0632.2 | TCFL5 | MA0632.2 | KCACGCGCMC | TCFL5 | 0.000519 | 3.73e-06 | 4160 | 26.5 | 10800 | 24.5 | 1.08 |
108 | 11 | MA1930.1 | CTCF | MA1930.1 | CTGCAGTKCCNVCHNNYRGCCASYAGRKGGCRAKC | CTCF | 0.00055 | 3.92e-06 | 3960 | 25.2 | 10300 | 23.3 | 1.08 |
109 | 16 | MA1648.1 | TCF12 | MA1648.1 | NNCACCTGCNN | TCF12 | 0.000573 | 4.04e-06 | 5300 | 33.8 | 14000 | 31.7 | 1.07 |
110 | 16 | MA0649.1 | HEY2 | MA0649.1 | GRCACGTGYC | HEY2 | 0.000632 | 4.42e-06 | 5100 | 32.5 | 13400 | 30.4 | 1.07 |
111 | 16 | MA0259.1 | ARNT::HIF1A | MA0259.1 | VBACGTGC | ARNT::HIF1A | 0.000669 | 4.64e-06 | 4060 | 25.9 | 10600 | 24 | 1.08 |
112 | 29 | MA0765.3 | ETV5 | MA0765.3 | ACCGGAAGTR | ETV5 | 0.000937 | 6.43e-06 | 3500 | 22.3 | 9060 | 20.5 | 1.09 |
113 | 20 | MA1627.1 | Wt1 | MA1627.1 | YBCCTCCCCCACVB | Wt1 | 0.00122 | 8.31e-06 | 1170 | 7.48 | 2820 | 6.38 | 1.17 |
114 | 40 | MA1979.1 | ZNF416 | MA1979.1 | TATCTGGGCAH | ZNF416 | 0.00151 | 1.02e-05 | 10700 | 68.1 | 29200 | 66 | 1.03 |
115 | 41 | MA1149.1 | RARA::RXRG | MA1149.1 | RRGGTCAHNNNRRGGTCA | RARA::RXRG | 0.00175 | 1.17e-05 | 8400 | 53.5 | 22700 | 51.3 | 1.04 |
116 | 16 | MA1631.1 | ASCL1 | MA1631.1 | NNGCACCTGCYNB | ASCL1 | 0.00204 | 1.35e-05 | 7620 | 48.6 | 20500 | 46.4 | 1.05 |
117 | 29 | MA0764.3 | ETV4 | MA0764.3 | ACCGGAAGTR | ETV4 | 0.00222 | 1.46e-05 | 5820 | 37.1 | 15500 | 35.1 | 1.06 |
118 | 12 | MA0163.1 | PLAG1 | MA0163.1 | GGGGCCCWAGGGGG | PLAG1 | 0.00228 | 1.49e-05 | 1220 | 7.78 | 2950 | 6.68 | 1.16 |
119 | 42 | MA1684.1 | Foxn1 | MA1684.1 | RGAMGC | Foxn1 | 0.00287 | 1.86e-05 | 1330 | 8.5 | 3250 | 7.37 | 1.15 |
120 | 43 | MA2002.1 | Zfp335 | MA2002.1 | VNTCAGGCANH | Zfp335 | 0.00296 | 1.89e-05 | 6670 | 42.5 | 17800 | 40.4 | 1.05 |
121 | 44 | MA0511.2 | RUNX2 | MA0511.2 | WAACCGCAA | RUNX2 | 0.00299 | 1.9e-05 | 10300 | 65.9 | 28200 | 63.9 | 1.03 |
122 | 11 | MA1929.1 | CTCF | MA1929.1 | CTGCAGTKCCNNNNNYNRCCASYAGRKGGCRSWG | CTCF | 0.0038 | 2.4e-05 | 1450 | 9.25 | 3570 | 8.09 | 1.14 |
123 | 34 | MA0633.2 | Twist2 | MA0633.2 | NVCAGCTGBN | Twist2 | 0.0045 | 2.81e-05 | 4470 | 28.5 | 11800 | 26.7 | 1.07 |
124 | 34 | MA1997.1 | Olig2 | MA1997.1 | NVCAGCTGBN | Olig2 | 0.00502 | 3.11e-05 | 4750 | 30.2 | 12500 | 28.4 | 1.07 |
125 | 45 | MA0145.2 | Tfcp2l1 | MA0145.2 | CCAGYYYVADCCRG | Tfcp2l1 | 0.00687 | 4.22e-05 | 14000 | 89.4 | 38900 | 88.2 | 1.01 |
126 | 46 | MA0767.1 | GCM2 | MA0767.1 | BATGCGGGTR | GCM2 | 0.00736 | 4.49e-05 | 4720 | 30.1 | 12500 | 28.3 | 1.06 |
127 | 47 | MA1972.1 | ZFP14 | MA1972.1 | GGAGGCHCHGGARBG | ZFP14 | 0.00763 | 4.62e-05 | 10300 | 65.5 | 28100 | 63.6 | 1.03 |
128 | 34 | MA1993.1 | Neurod2 | MA1993.1 | VVCAGCTGBB | Neurod2 | 0.00793 | 4.76e-05 | 4960 | 31.6 | 13100 | 29.7 | 1.06 |
129 | 48 | MA1475.1 | CREB3L4 | MA1475.1 | GRTGACGTCAYC | CREB3L4 | 0.0124 | 7.39e-05 | 3470 | 22.1 | 9050 | 20.5 | 1.08 |
7 | 52 | MA0509.3 (RFX1) | STREME-4 | 4-MTGTTGCCCAGGC | MTGTTGCCCAGGC | RFX1 | 0.0152 | 0.00217 | 7420 | 47.3 | 20300 | 45.9 | 1.03 |
130 | 41 | MA0159.1 | RARA::RXRA | MA0159.1 | RGKTCANVGRSAGGTCA | RARA::RXRA | 0.0159 | 9.39e-05 | 5390 | 34.3 | 14400 | 32.5 | 1.06 |
131 | 49 | MA1555.1 | RXRB | MA1555.1 | RRGGTCATGACCYY | RXRB | 0.0172 | 0.000101 | 4600 | 29.3 | 12200 | 27.6 | 1.06 |
132 | 50 | MA1710.1 | ZNF257 | MA1710.1 | VDGAGGCRAGRG | ZNF257 | 0.0245 | 0.000143 | 3290 | 20.9 | 8580 | 19.4 | 1.08 |
133 | 29 | MA0028.2 | ELK1 | MA0028.2 | ACCGGAAGTR | ELK1 | 0.0255 | 0.000148 | 3380 | 21.5 | 8820 | 20 | 1.08 |
134 | 16 | MA1560.1 | SOHLH2 | MA1560.1 | YGCACGTGCN | SOHLH2 | 0.0259 | 0.000149 | 11000 | 70 | 30200 | 68.3 | 1.03 |
135 | 51 | MA1964.1 | SMAD2 | MA1964.1 | NBCCAGACDN | SMAD2 | 0.0328 | 0.000187 | 4030 | 25.7 | 10600 | 24.1 | 1.07 |
136 | 16 | MA1934.1 | ERF::FIGLA | MA1934.1 | RCCGGAARCASCTGBN | ERF::FIGLA | 0.0334 | 0.000189 | 2880 | 18.3 | 7470 | 16.9 | 1.08 |
137 | 48 | MA0615.1 | Gmeb1 | MA0615.1 | BHBBKKACGTMMNWNNN | Gmeb1 | 0.0371 | 0.000208 | 6950 | 44.3 | 18700 | 42.5 | 1.04 |
8 | 53 | 8-CAGTGAGCTGWG | STREME-8 | 8-CAGTGAGCTGWG | CAGTGAGCTGWG | CAGTGAGCTGWG | 1.58 | 0.197 | 2060 | 13.1 | 5660 | 12.8 | 1.03 |
9 | 54 | MA0476.1 (FOS) | MEME-2 | TGASTCAT | TGASTCAT | FOS | 7.26 | 0.806 | 154 | 0.98 | 456 | 1.03 | 0.954 |
10 | 39 | MA1125.1 (ZNF384) | MEME-1 | AAAAAAAAAAAAAAA | AAAAAAAAAAAAAAA | ZNF384 | 8.44 | 0.844 | 1850 | 11.8 | 5330 | 12.1 | 0.975 |
EPI_24_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("EPI_24_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
EPI_24_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("EPI_24_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DNR_3_DAR_DEG_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DNR_3_DEG_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
DNR_3_DAR_DEG_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DNR_3_DEG_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
DNR_3_DAR_DEG_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DNR_3_DEG_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
DNR_3_DAR_DEG_sea_all <- rbind(DNR_3_DAR_DEG_sea_disc, DNR_3_DAR_DEG_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
DNR_3_DAR_DEG_combo <- DNR_3_DAR_DEG_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., DNR_3_DAR_DEG_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
DNR_3_DAR_DEG_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in MTX v VEH") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA1721.1 (ZNF93) | STREME-4 | 4-GCCGGGGCCGC | GCCGGGGCCGC | ZNF93 | 3.14e-12 | 2.85e-12 | 113 | 19.8 | 1770 | 8.92 | 2.23 |
2 | 2 | 6-CCGRAGC | STREME-6 | 6-CCGRAGC | CCGRAGC | CCGRAGC | 9.07e-09 | 4.12e-09 | 308 | 53.8 | 7890 | 39.7 | 1.36 |
3 | 3 | MA1653.1 (ZNF148) | STREME-7 | 7-GCCCCCTCC | GCCCCCTCC | ZNF148 | 2.73e-08 | 8.26e-09 | 266 | 46.5 | 6570 | 33 | 1.41 |
4 | 4 | 5-CGDGACGHGV | STREME-5 | 5-CGDGACGHGV | CGDGACGHGV | CGDGACGHGV | 2.52e-06 | 5.72e-07 | 259 | 45.3 | 6620 | 33.3 | 1.36 |
5 | 5 | MA1721.1 (ZNF93) | MEME-2 | SSCGCCGCCGCCGCC | SSCGCCGCCGCCGCC | ZNF93 | 6.23e-06 | 1.13e-06 | 297 | 51.9 | 7940 | 39.9 | 1.3 |
6 | 6 | MA0742.2 (KLF12) | STREME-2 | 2-CCGCCC | CCGCCC | KLF12 | 8.52e-06 | 1.29e-06 | 223 | 39 | 5540 | 27.9 | 1.4 |
7 | 20 | MA0162.4 | EGR1 | MA0162.4 | VCMCGCCCACGCVS | EGR1 | 3.18e-05 | 4.13e-06 | 394 | 68.9 | 11500 | 57.7 | 1.19 |
8 | 7 | MA0685.2 | SP4 | MA0685.2 | GGGGCGGGG | SP4 | 0.000297 | 3.37e-05 | 400 | 69.9 | 11900 | 59.7 | 1.17 |
9 | 8 | MA1713.1 | ZNF610 | MA1713.1 | SSCGCCGCTCCSSS | ZNF610 | 0.000413 | 4.17e-05 | 393 | 68.7 | 11600 | 58.5 | 1.17 |
10 | 7 | MA1965.1 | SP5 | MA1965.1 | CBCCTCCCHS | SP5 | 0.000692 | 6.29e-05 | 389 | 68 | 11500 | 58 | 1.17 |
11 | 9 | MA1583.1 | ZFP57 | MA1583.1 | NNVTTGCCGCANN | ZFP57 | 0.000948 | 7.83e-05 | 213 | 37.2 | 5550 | 27.9 | 1.34 |
7 | 16 | 1-CCATCCCG | STREME-1 | 1-CCATCCCG | CCATCCCG | CCATCCCG | 0.00111 | 0.000159 | 178 | 31.1 | 4790 | 24.1 | 1.3 |
12 | 6 | MA1513.1 | KLF15 | MA1513.1 | SCCCCGCCCCS | KLF15 | 0.00132 | 9.97e-05 | 241 | 42.1 | 6480 | 32.6 | 1.3 |
13 | 7 | MA1959.1 | KLF7 | MA1959.1 | DGGGCGGGG | KLF7 | 0.00158 | 0.000111 | 328 | 57.3 | 9440 | 47.4 | 1.21 |
8 | 17 | MA1125.1 (ZNF384) | MEME-1 | TTTTTTTTTTTTTWT | TTTTTTTTTTTTTWT | ZNF384 | 0.00232 | 0.000289 | 23 | 4.02 | 336 | 1.69 | 2.47 |
14 | 7 | MA0079.5 | SP1 | MA0079.5 | GGGGCGGGG | SP1 | 0.00248 | 0.000161 | 340 | 59.4 | 9900 | 49.8 | 1.2 |
15 | 10 | MA0024.3 | E2F1 | MA0024.3 | TTTGGCGCCAAA | E2F1 | 0.00794 | 0.000481 | 223 | 39 | 6040 | 30.4 | 1.29 |
16 | 6 | MA0742.2 | KLF12 | MA0742.2 | GGGGCGGGG | KLF12 | 0.00986 | 0.00056 | 315 | 55.1 | 9150 | 46 | 1.2 |
17 | 11 | MA0131.2 | HINFP | MA0131.2 | CARCGTCCGCKN | HINFP | 0.0115 | 0.000613 | 269 | 47 | 7600 | 38.2 | 1.23 |
18 | 9 | MA1122.1 | TFDP1 | MA1122.1 | VSGCGGGAAVN | TFDP1 | 0.0128 | 0.000614 | 267 | 46.7 | 7540 | 37.9 | 1.23 |
19 | 12 | MA0506.2 | Nrf1 | MA0506.2 | SNCTGCGCMTGCGCV | Nrf1 | 0.0129 | 0.000614 | 351 | 61.4 | 10400 | 52.5 | 1.17 |
20 | 7 | MA1961.1 | PATZ1 | MA1961.1 | SSGGGGMGGGGS | PATZ1 | 0.0231 | 0.001 | 361 | 63.1 | 10900 | 54.6 | 1.16 |
21 | 3 | MA1522.1 | MAZ | MA1522.1 | BBCCCCTCCCY | MAZ | 0.0231 | 0.001 | 352 | 61.5 | 10500 | 53 | 1.16 |
22 | 13 | MA0527.1 | ZBTB33 | MA0527.1 | BTCTCGCGAGABYTS | ZBTB33 | 0.0258 | 0.00107 | 121 | 21.2 | 2930 | 14.7 | 1.45 |
23 | 14 | MA0116.1 | Znf423 | MA0116.1 | GSMMCCYARGGKKBM | Znf423 | 0.0294 | 0.00114 | 18 | 3.15 | 196 | 0.99 | 3.35 |
24 | 15 | MA1656.1 | ZNF449 | MA1656.1 | NNAAGCCCAACCNN | ZNF449 | 0.0301 | 0.00114 | 521 | 91.1 | 17000 | 85.4 | 1.07 |
9 | 18 | 3-AGAAGGCG | STREME-3 | 3-AGAAGGCG | AGAAGGCG | AGAAGGCG | 0.0399 | 0.00444 | 449 | 78.5 | 14600 | 73.5 | 1.07 |
25 | 3 | MA1653.1 | ZNF148 | MA1653.1 | YBCCCCTCCCCC | ZNF148 | 0.0427 | 0.00155 | 281 | 49.1 | 8130 | 40.9 | 1.2 |
10 | 19 | MA0814.2 (TFAP2C) | STREME-8 | 8-ACCTGAGCCGGGC | ACCTGAGCCGGGC | TFAP2C | 0.108 | 0.0108 | 136 | 23.8 | 3910 | 19.7 | 1.22 |
11 | 20 | MA1107.2 (KLF9) | MEME-3 | ACACACACRCACACA | ACACACACRCACACA | KLF9 | 11 | 1 | 0 | 0 | 0 | 0 | 34.7 |
DNR_3_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DNR_3_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DNR_3_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DNR_3_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DNR_24_DAR_DEG_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DNR_24_DEG_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
DNR_24_DAR_DEG_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DNR_24_DEG_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
DNR_24_DAR_DEG_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/DNR_24_DEG_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
DNR_24_DAR_DEG_sea_all <- rbind(DNR_24_DAR_DEG_sea_disc, DNR_24_DAR_DEG_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
DNR_24_DAR_DEG_combo <- DNR_24_DAR_DEG_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., DNR_24_DAR_DEG_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
DNR_24_DAR_DEG_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in MTX v VEH") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | 2-SGCSNSGCS | STREME-2 | 2-SGCSNSGCS | SGCSNSGCS | SGCSNSGCS | 2.46e-45 | 1.79e-45 | 6880 | 31.9 | 13400 | 26.6 | 1.2 |
2 | 2 | 7-CCGGCC | STREME-7 | 7-CCGGCC | CCGGCC | CCGGCC | 1.43e-42 | 5.2e-43 | 4730 | 22 | 8800 | 17.4 | 1.26 |
3 | 3 | MA1529.1 (NHLH2) | STREME-9 | 9-CGCCCC | CGCCCC | NHLH2 | 1.65e-37 | 4e-38 | 7510 | 34.9 | 15100 | 29.8 | 1.17 |
4 | 3 | MA0742.2 (KLF12) | STREME-3 | 3-CCCGCCCCCA | CCCGCCCCCA | KLF12 | 4.26e-37 | 7.75e-38 | 3430 | 16 | 6180 | 12.2 | 1.3 |
5 | 4 | MA1513.1 | KLF15 | MA1513.1 | SCCCCGCCCCS | KLF15 | 6.03e-35 | 8.77e-36 | 3380 | 15.7 | 6120 | 12.1 | 1.3 |
6 | 30 | MA1596.1 | ZNF460 | MA1596.1 | GCCTCMGCCTCCCRAG | ZNF460 | 1.3e-34 | 1.58e-35 | 7740 | 35.9 | 15700 | 31 | 1.16 |
7 | 4 | MA1713.1 | ZNF610 | MA1713.1 | SSCGCCGCTCCSSS | ZNF610 | 4.91e-32 | 5.1e-33 | 3910 | 18.2 | 7320 | 14.5 | 1.25 |
8 | 4 | MA1522.1 | MAZ | MA1522.1 | BBCCCCTCCCY | MAZ | 9.28e-31 | 8.44e-32 | 5490 | 25.5 | 10800 | 21.4 | 1.19 |
9 | 4 | MA1961.1 | PATZ1 | MA1961.1 | SSGGGGMGGGGS | PATZ1 | 9.5e-30 | 7.67e-31 | 4750 | 22.1 | 9190 | 18.2 | 1.21 |
10 | 5 | MA1650.1 | ZBTB14 | MA1650.1 | SSCCGCGCACVS | ZBTB14 | 1.67e-29 | 1.21e-30 | 4140 | 19.2 | 7880 | 15.6 | 1.23 |
11 | 6 | MA0146.2 | Zfx | MA0146.2 | SSSGCCBVGGCCTS | Zfx | 2.91e-28 | 1.92e-29 | 7970 | 37 | 16400 | 32.6 | 1.14 |
12 | 7 | MA1122.1 | TFDP1 | MA1122.1 | VSGCGGGAAVN | TFDP1 | 2.34e-26 | 1.42e-27 | 5090 | 23.6 | 10000 | 19.9 | 1.19 |
13 | 4 | MA0471.2 | E2F6 | MA0471.2 | VVDGGCGGGAAVV | E2F6 | 2.82e-26 | 1.58e-27 | 9240 | 42.9 | 19400 | 38.5 | 1.12 |
14 | 4 | MA0079.5 | SP1 | MA0079.5 | GGGGCGGGG | SP1 | 3.93e-26 | 2.04e-27 | 4600 | 21.4 | 8980 | 17.8 | 1.2 |
15 | 8 | 1-MCGSASS | STREME-1 | 1-MCGSASS | MCGSASS | MCGSASS | 1.09e-25 | 5.31e-27 | 5260 | 24.4 | 10400 | 20.7 | 1.18 |
16 | 9 | MA0506.2 | Nrf1 | MA0506.2 | SNCTGCGCMTGCGCV | Nrf1 | 3.22e-25 | 1.46e-26 | 4410 | 20.5 | 8580 | 17 | 1.2 |
17 | 4 | MA1653.1 | ZNF148 | MA1653.1 | YBCCCCTCCCCC | ZNF148 | 4.54e-25 | 1.94e-26 | 6350 | 29.5 | 12900 | 25.6 | 1.15 |
18 | 4 | MA1965.1 | SP5 | MA1965.1 | CBCCTCCCHS | SP5 | 6.07e-25 | 2.45e-26 | 9790 | 45.5 | 20700 | 41.1 | 1.11 |
19 | 4 | MA1564.1 | SP9 | MA1564.1 | RCCACGCCCMCY | SP9 | 1.19e-24 | 4.54e-26 | 10300 | 48 | 22000 | 43.6 | 1.1 |
20 | 10 | MA0811.1 | TFAP2B | MA0811.1 | YGCCCBVRGGCA | TFAP2B | 1.9e-24 | 6.89e-26 | 8270 | 38.4 | 17300 | 34.2 | 1.12 |
21 | 11 | MA0162.4 | EGR1 | MA0162.4 | VCMCGCCCACGCVS | EGR1 | 2.14e-24 | 7.41e-26 | 4390 | 20.4 | 8570 | 17 | 1.2 |
22 | 12 | MA1986.1 | ZNF692 | MA1986.1 | BGGGCCCASN | ZNF692 | 6.1e-24 | 2.02e-25 | 8440 | 39.2 | 17700 | 35 | 1.12 |
23 | 10 | MA0524.2 | TFAP2C | MA0524.2 | YGCCYBVRGGCA | TFAP2C | 8.82e-24 | 2.79e-25 | 8890 | 41.3 | 18700 | 37.1 | 1.11 |
24 | 10 | MA0810.1 | TFAP2A | MA0810.1 | YGCCCBVRGGCR | TFAP2A | 1.1e-23 | 3.33e-25 | 7950 | 36.9 | 16600 | 32.8 | 1.13 |
25 | 4 | MA0746.2 | SP3 | MA0746.2 | NGCCACGCCCMCH | SP3 | 1.41e-23 | 4.1e-25 | 4420 | 20.5 | 8660 | 17.2 | 1.2 |
26 | 13 | MA1976.1 | ZNF320 | MA1976.1 | DGKGGGRCCMRGGGGCCWGTGHVNNNN | ZNF320 | 2.39e-23 | 6.68e-25 | 6680 | 31 | 13700 | 27.1 | 1.14 |
27 | 4 | MA1515.1 | KLF2 | MA1515.1 | NRCCACRCCCH | KLF2 | 7.79e-23 | 2.1e-24 | 6730 | 31.2 | 13800 | 27.4 | 1.14 |
28 | 4 | MA0516.3 | SP2 | MA0516.3 | GGGGCGGGG | SP2 | 4.25e-22 | 1.1e-23 | 2190 | 10.2 | 3930 | 7.79 | 1.3 |
29 | 4 | MA0753.2 | ZNF740 | MA0753.2 | CYRCCCCCCCCAC | ZNF740 | 1.5e-21 | 3.77e-23 | 7310 | 34 | 15200 | 30.1 | 1.13 |
30 | 14 | MA1102.2 | CTCFL | MA1102.2 | NSCAGGGGGCGS | CTCFL | 4.87e-21 | 1.18e-22 | 3520 | 16.3 | 6780 | 13.4 | 1.22 |
31 | 15 | MA0024.3 | E2F1 | MA0024.3 | TTTGGCGCCAAA | E2F1 | 6.38e-21 | 1.5e-22 | 5160 | 24 | 10400 | 20.6 | 1.16 |
32 | 15 | MA1615.1 | Plagl1 | MA1615.1 | NNCTGGGGCCABN | Plagl1 | 2.3e-20 | 5.22e-22 | 7270 | 33.8 | 15100 | 30 | 1.12 |
33 | 4 | MA1516.1 | KLF3 | MA1516.1 | GRCCRCGCCCH | KLF3 | 2.81e-20 | 6.19e-22 | 5690 | 26.4 | 11600 | 23 | 1.15 |
34 | 4 | MA0599.1 | KLF5 | MA0599.1 | GCCCCDCCCH | KLF5 | 5.09e-20 | 1.09e-21 | 5510 | 25.6 | 11200 | 22.2 | 1.15 |
35 | 16 | MA0812.1 | TFAP2B | MA0812.1 | HGCCTSAGGCD | TFAP2B | 1.27e-19 | 2.64e-21 | 8590 | 39.9 | 18200 | 36.1 | 1.11 |
36 | 16 | MA0003.4 | TFAP2A | MA0003.4 | NNKGCCTCAGGCNN | TFAP2A | 2.78e-19 | 5.62e-21 | 6570 | 30.5 | 13600 | 27 | 1.13 |
37 | 4 | MA1517.1 | KLF6 | MA1517.1 | NRCCACGCCCH | KLF6 | 3.68e-19 | 7.24e-21 | 6830 | 31.7 | 14200 | 28.2 | 1.13 |
38 | 17 | MA0823.1 | HEY1 | MA0823.1 | GRCACGTGCC | HEY1 | 4.89e-19 | 9.35e-21 | 6860 | 31.8 | 14300 | 28.3 | 1.13 |
39 | 4 | MA0685.2 | SP4 | MA0685.2 | GGGGCGGGG | SP4 | 9.53e-19 | 1.78e-20 | 4580 | 21.2 | 9180 | 18.2 | 1.17 |
40 | 4 | MA0742.2 | KLF12 | MA0742.2 | GGGGCGGGG | KLF12 | 1.91e-18 | 3.47e-20 | 2240 | 10.4 | 4140 | 8.2 | 1.27 |
41 | 18 | MA0131.2 | HINFP | MA0131.2 | CARCGTCCGCKN | HINFP | 2.68e-18 | 4.75e-20 | 4610 | 21.4 | 9270 | 18.4 | 1.17 |
42 | 7 | MA1583.1 | ZFP57 | MA1583.1 | NNVTTGCCGCANN | ZFP57 | 3.41e-18 | 5.91e-20 | 4730 | 22 | 9550 | 18.9 | 1.16 |
43 | 19 | MA1712.1 | ZNF454 | MA1712.1 | DGGCTCSSGGCCCYGVKG | ZNF454 | 4.96e-18 | 8.39e-20 | 3710 | 17.2 | 7310 | 14.5 | 1.19 |
44 | 20 | MA1721.1 | ZNF93 | MA1721.1 | GGYAGCRGCAGCGGYG | ZNF93 | 2.14e-17 | 3.54e-19 | 9140 | 42.4 | 19600 | 38.8 | 1.09 |
45 | 4 | MA0039.4 | KLF4 | MA0039.4 | NNCCCCACCCHN | KLF4 | 2.42e-17 | 3.91e-19 | 11600 | 53.9 | 25300 | 50.2 | 1.07 |
46 | 11 | MA0733.1 | EGR4 | MA0733.1 | NHMCGCCCACGCAHWT | EGR4 | 2.57e-17 | 4.06e-19 | 9150 | 42.5 | 19600 | 38.8 | 1.09 |
47 | 4 | MA0493.2 | KLF1 | MA0493.2 | DGGGYGGGG | KLF1 | 3.22e-17 | 4.98e-19 | 3550 | 16.5 | 6980 | 13.8 | 1.19 |
48 | 21 | MA0527.1 | ZBTB33 | MA0527.1 | BTCTCGCGAGABYTS | ZBTB33 | 4.17e-17 | 6.31e-19 | 5000 | 23.2 | 10200 | 20.2 | 1.15 |
49 | 22 | MA0751.1 | ZIC4 | MA0751.1 | GRCCCCCCGCKGYGH | ZIC4 | 6.92e-17 | 1.02e-18 | 7670 | 35.6 | 16200 | 32.2 | 1.11 |
50 | 7 | MA0865.2 | E2F8 | MA0865.2 | TTCCCGCCAHWA | E2F8 | 7.03e-17 | 1.02e-18 | 3930 | 18.2 | 7820 | 15.5 | 1.18 |
51 | 11 | MA1627.1 | Wt1 | MA1627.1 | YBCCTCCCCCACVB | Wt1 | 8.02e-17 | 1.14e-18 | 4140 | 19.2 | 8300 | 16.4 | 1.17 |
52 | 13 | MA1723.1 | PRDM9 | MA1723.1 | RGDGGGVAGGGRGGVRRMARVARR | PRDM9 | 1.07e-16 | 1.5e-18 | 6920 | 32.1 | 14500 | 28.8 | 1.12 |
53 | 11 | MA0732.1 | EGR3 | MA0732.1 | HHMCGCCCACGCAHH | EGR3 | 4.59e-16 | 6.29e-18 | 3570 | 16.6 | 7070 | 14 | 1.18 |
54 | 23 | MA0067.2 | PAX2 | MA0067.2 | BCAVTSRAGCGTGACGR | PAX2 | 4.9e-16 | 6.59e-18 | 5520 | 25.6 | 11400 | 22.6 | 1.14 |
55 | 24 | MA1548.1 | PLAGL2 | MA1548.1 | NGGGCCCCCN | PLAGL2 | 5.28e-16 | 6.98e-18 | 5390 | 25 | 11100 | 22 | 1.14 |
56 | 25 | MA1973.1 | ZKSCAN3 | MA1973.1 | KDKCCCAGGCTAGCCCABNN | ZKSCAN3 | 6.11e-16 | 7.94e-18 | 7770 | 36.1 | 16500 | 32.7 | 1.1 |
57 | 15 | MA0864.2 | E2F2 | MA0864.2 | RWTTTGGCGCCAWWWY | E2F2 | 7.08e-16 | 9.03e-18 | 4150 | 19.3 | 8350 | 16.6 | 1.17 |
58 | 23 | MA0014.3 | PAX5 | MA0014.3 | RRGCGTGACCNN | PAX5 | 1.1e-15 | 1.38e-17 | 6930 | 32.2 | 14600 | 28.9 | 1.11 |
59 | 4 | MA0740.2 | KLF14 | MA0740.2 | KGGGCGGGG | KLF14 | 1.5e-15 | 1.85e-17 | 5600 | 26 | 11600 | 23 | 1.13 |
60 | 22 | MA1578.1 | VEZF1 | MA1578.1 | CCCCCCMYDH | VEZF1 | 2.25e-15 | 2.72e-17 | 9400 | 43.6 | 20300 | 40.2 | 1.09 |
61 | 4 | MA0741.1 | KLF16 | MA0741.1 | GMCACGCCCCC | KLF16 | 2.55e-15 | 3.04e-17 | 2640 | 12.2 | 5070 | 10 | 1.22 |
62 | 26 | MA0759.2 | ELK3 | MA0759.2 | ACCGGAAGTRV | ELK3 | 4.15e-15 | 4.86e-17 | 4330 | 20.1 | 8770 | 17.4 | 1.16 |
63 | 16 | MA0814.2 | TFAP2C | MA0814.2 | NNGCCTGAGGCGNN | TFAP2C | 7.01e-15 | 8.09e-17 | 3000 | 13.9 | 5860 | 11.6 | 1.2 |
64 | 27 | MA1599.1 | ZNF682 | MA1599.1 | BVGGCYAAGCCCCWNY | ZNF682 | 8.56e-15 | 9.72e-17 | 10700 | 49.7 | 23300 | 46.2 | 1.07 |
65 | 4 | MA1630.2 | ZNF281 | MA1630.2 | GGGGGAGGGGVV | ZNF281 | 9.06e-15 | 1.01e-16 | 3150 | 14.6 | 6210 | 12.3 | 1.19 |
66 | 4 | MA1511.2 | KLF10 | MA1511.2 | GGGGCGGGG | KLF10 | 3.2e-14 | 3.53e-16 | 851 | 3.95 | 1390 | 2.75 | 1.44 |
67 | 28 | MA1981.1 | ZNF530 | MA1981.1 | GMARGGMRAGGGGCRGSV | ZNF530 | 5.68e-14 | 6.16e-16 | 7330 | 34 | 15600 | 30.9 | 1.1 |
68 | 15 | MA0470.2 | E2F4 | MA0470.2 | TTTTGGCGCCAWWW | E2F4 | 6.05e-14 | 6.47e-16 | 2320 | 10.8 | 4440 | 8.81 | 1.23 |
69 | 29 | MA1985.1 | ZNF669 | MA1985.1 | GGGGYGAYGACCRCT | ZNF669 | 6.68e-14 | 7.03e-16 | 4960 | 23.1 | 10200 | 20.3 | 1.14 |
70 | 30 | MA1596.1 (ZNF460) | STREME-5 | 5-CCGCCTCC | CCGCCTCC | ZNF460 | 1.76e-13 | 1.83e-15 | 16900 | 78.5 | 38200 | 75.7 | 1.04 |
71 | 17 | MA0632.2 | TCFL5 | MA0632.2 | KCACGCGCMC | TCFL5 | 4.43e-13 | 4.54e-15 | 4520 | 21 | 9290 | 18.4 | 1.14 |
72 | 4 | MA1959.1 | KLF7 | MA1959.1 | DGGGCGGGG | KLF7 | 1.61e-12 | 1.63e-14 | 1190 | 5.54 | 2110 | 4.17 | 1.33 |
73 | 31 | MA0748.2 | YY2 | MA0748.2 | NVATGGCGGCS | YY2 | 2.15e-12 | 2.14e-14 | 2270 | 10.5 | 4370 | 8.66 | 1.22 |
74 | 13 | MA0163.1 | PLAG1 | MA0163.1 | GGGGCCCWAGGGGG | PLAG1 | 2.26e-12 | 2.22e-14 | 2620 | 12.2 | 5130 | 10.2 | 1.2 |
75 | 32 | MA1728.1 | ZNF549 | MA1728.1 | NNTGCTGCCCWR | ZNF549 | 3.31e-12 | 3.21e-14 | 8280 | 38.5 | 17900 | 35.4 | 1.09 |
76 | 13 | MA1930.1 | CTCF | MA1930.1 | CTGCAGTKCCNVCHNNYRGCCASYAGRKGGCRAKC | CTCF | 4.3e-12 | 4.11e-14 | 5350 | 24.9 | 11200 | 22.2 | 1.12 |
77 | 11 | MA0472.2 | EGR2 | MA0472.2 | MCGCCCACGCA | EGR2 | 4.81e-12 | 4.54e-14 | 8680 | 40.3 | 18800 | 37.3 | 1.08 |
78 | 17 | MA0821.2 | HES5 | MA0821.2 | GRCACGTGYC | HES5 | 6.91e-12 | 6.44e-14 | 12100 | 56.1 | 26700 | 53 | 1.06 |
79 | 17 | MA1648.1 | TCF12 | MA1648.1 | NNCACCTGCNN | TCF12 | 8.92e-12 | 8.21e-14 | 4430 | 20.6 | 9140 | 18.1 | 1.14 |
80 | 33 | MA0695.1 | ZBTB7C | MA0695.1 | RCGACCACCGAN | ZBTB7C | 1.44e-11 | 1.31e-13 | 13400 | 62.2 | 29900 | 59.2 | 1.05 |
81 | 34 | MA0145.2 | Tfcp2l1 | MA0145.2 | CCAGYYYVADCCRG | Tfcp2l1 | 2.86e-11 | 2.57e-13 | 15000 | 69.5 | 33700 | 66.7 | 1.04 |
82 | 33 | MA1990.1 | Gli1 | MA1990.1 | NNRGACCACCCASV | Gli1 | 4.45e-11 | 3.94e-13 | 10500 | 48.8 | 23100 | 45.8 | 1.07 |
83 | 17 | MA1106.1 | HIF1A | MA1106.1 | SBACGTGCNN | HIF1A | 5.36e-11 | 4.7e-13 | 11300 | 52.5 | 25000 | 49.5 | 1.06 |
84 | 17 | MA0830.2 | TCF4 | MA0830.2 | NNGCACCTGCCNN | TCF4 | 1.5e-10 | 1.3e-12 | 3740 | 17.4 | 7670 | 15.2 | 1.14 |
85 | 11 | MA1107.2 | KLF9 | MA1107.2 | MNGCCACACCCACHYV | KLF9 | 2.31e-10 | 1.98e-12 | 14600 | 67.8 | 32800 | 65 | 1.04 |
86 | 26 | MA0750.2 | ZBTB7A | MA0750.2 | NVCCGGAAGTGSV | ZBTB7A | 2.54e-10 | 2.15e-12 | 16100 | 74.6 | 36300 | 72 | 1.04 |
87 | 4 | MA1512.1 | KLF11 | MA1512.1 | SCCACGCCCMC | KLF11 | 3.94e-10 | 3.29e-12 | 1580 | 7.32 | 2970 | 5.89 | 1.24 |
88 | 4 | MA0747.1 | SP8 | MA0747.1 | RCCACGCCCMCY | SP8 | 4.73e-10 | 3.91e-12 | 2380 | 11.1 | 4700 | 9.32 | 1.19 |
89 | 35 | MA1581.1 | ZBTB6 | MA1581.1 | DNSCTTGAGCCNV | ZBTB6 | 8.73e-10 | 7.13e-12 | 14000 | 65.1 | 31500 | 62.4 | 1.04 |
90 | 36 | 6-CCGAGA | STREME-6 | 6-CCGAGA | CCGAGA | CCGAGA | 1.66e-09 | 1.34e-11 | 10500 | 49 | 23300 | 46.1 | 1.06 |
91 | 26 | MA0764.3 | ETV4 | MA0764.3 | ACCGGAAGTR | ETV4 | 1.86e-09 | 1.49e-11 | 15300 | 71.2 | 34600 | 68.6 | 1.04 |
92 | 10 | MA1569.1 | TFAP2E | MA1569.1 | YGCCYSRGGCD | TFAP2E | 3.26e-09 | 2.58e-11 | 1180 | 5.48 | 2160 | 4.29 | 1.28 |
93 | 26 | MA0765.3 | ETV5 | MA0765.3 | ACCGGAAGTR | ETV5 | 5.98e-09 | 4.68e-11 | 6100 | 28.3 | 13100 | 25.9 | 1.09 |
94 | 17 | MA1099.2 | HES1 | MA1099.2 | GGCRCGTGBC | HES1 | 6.68e-09 | 5.16e-11 | 472 | 2.19 | 740 | 1.47 | 1.5 |
95 | 37 | MA1710.1 | ZNF257 | MA1710.1 | VDGAGGCRAGRG | ZNF257 | 8.39e-09 | 6.41e-11 | 10900 | 50.5 | 24100 | 47.8 | 1.06 |
96 | 26 | MA0156.3 | FEV | MA0156.3 | VACCGGAAGTVV | FEV | 8.47e-09 | 6.41e-11 | 8140 | 37.8 | 17800 | 35.2 | 1.07 |
97 | 17 | MA0616.2 | HES2 | MA0616.2 | GGCACGTGYC | HES2 | 1.38e-08 | 1.04e-10 | 7710 | 35.8 | 16800 | 33.2 | 1.08 |
98 | 22 | MA0736.1 | GLIS2 | MA0736.1 | GACCCCCCGCRAMG | GLIS2 | 1.51e-08 | 1.12e-10 | 4020 | 18.7 | 8400 | 16.6 | 1.12 |
99 | 17 | MA0649.1 | HEY2 | MA0649.1 | GRCACGTGYC | HEY2 | 3.32e-08 | 2.44e-10 | 5500 | 25.6 | 11800 | 23.3 | 1.1 |
100 | 38 | MA1972.1 | ZFP14 | MA1972.1 | GGAGGCHCHGGARBG | ZFP14 | 3.69e-08 | 2.68e-10 | 9430 | 43.8 | 20800 | 41.2 | 1.06 |
101 | 39 | MA1982.1 | ZNF574 | MA1982.1 | VGSCTAGAGMGGCCS | ZNF574 | 4.75e-08 | 3.42e-10 | 8030 | 37.3 | 17500 | 34.8 | 1.07 |
102 | 11 | MA1656.1 | ZNF449 | MA1656.1 | NNAAGCCCAACCNN | ZNF449 | 7.16e-08 | 5.1e-10 | 17300 | 80.6 | 39600 | 78.5 | 1.03 |
103 | 15 | MA1727.1 | ZNF417 | MA1727.1 | VRBVNTGGGCGCCAM | ZNF417 | 1.17e-07 | 8.23e-10 | 3300 | 15.3 | 6830 | 13.5 | 1.13 |
104 | 33 | MA0694.1 | ZBTB7B | MA0694.1 | RCGACCACCGAA | ZBTB7B | 1.77e-07 | 1.24e-09 | 4830 | 22.4 | 10300 | 20.4 | 1.1 |
105 | 17 | MA0006.1 | Ahr::Arnt | MA0006.1 | YGCGTG | Ahr::Arnt | 1.98e-07 | 1.37e-09 | 5700 | 26.5 | 12300 | 24.3 | 1.09 |
106 | 17 | MA1941.1 | ETV2::FIGLA | MA1941.1 | RSCGGAARCAGSTGBN | ETV2::FIGLA | 2.36e-07 | 1.61e-09 | 13900 | 64.6 | 31400 | 62.2 | 1.04 |
107 | 14 | MA0155.1 | INSM1 | MA0155.1 | TGYCWGGGGGCR | INSM1 | 2.37e-07 | 1.61e-09 | 5870 | 27.3 | 12600 | 25.1 | 1.09 |
108 | 22 | MA0696.1 | ZIC1 | MA0696.1 | GACCCCCYGCTGTG | ZIC1 | 2.63e-07 | 1.77e-09 | 4750 | 22.1 | 10100 | 20 | 1.1 |
109 | 17 | MA0259.1 | ARNT::HIF1A | MA0259.1 | VBACGTGC | ARNT::HIF1A | 2.65e-07 | 1.77e-09 | 3990 | 18.5 | 8390 | 16.6 | 1.12 |
110 | 17 | MA0522.3 | TCF3 | MA0522.3 | NVCACCTGCNN | TCF3 | 5.45e-07 | 3.6e-09 | 3760 | 17.4 | 7880 | 15.6 | 1.12 |
111 | 40 | MA1587.1 | ZNF135 | MA1587.1 | CCTCGACCTCCYRR | ZNF135 | 7.18e-07 | 4.7e-09 | 5590 | 26 | 12000 | 23.8 | 1.09 |
112 | 22 | MA1584.1 | ZIC5 | MA1584.1 | VGACCCCCCGCTGHGM | ZIC5 | 9.7e-07 | 6.3e-09 | 1900 | 8.8 | 3770 | 7.48 | 1.18 |
113 | 41 | MA1730.1 | ZNF708 | MA1730.1 | NNRGCTGTRCCTCCY | ZNF708 | 1.17e-06 | 7.55e-09 | 13000 | 60.2 | 29200 | 57.8 | 1.04 |
114 | 42 | MA0511.2 | RUNX2 | MA0511.2 | WAACCGCAA | RUNX2 | 2.66e-06 | 1.7e-08 | 14900 | 69.3 | 33800 | 67.1 | 1.03 |
115 | 17 | MA1997.1 | Olig2 | MA1997.1 | NVCAGCTGBN | Olig2 | 3.17e-06 | 2e-08 | 9500 | 44.1 | 21100 | 41.8 | 1.06 |
116 | 43 | MA1555.1 | RXRB | MA1555.1 | RRGGTCATGACCYY | RXRB | 3.46e-06 | 2.17e-08 | 7480 | 34.8 | 16400 | 32.5 | 1.07 |
117 | 17 | MA1631.1 | ASCL1 | MA1631.1 | NNGCACCTGCYNB | ASCL1 | 5.98e-06 | 3.71e-08 | 3520 | 16.3 | 7400 | 14.7 | 1.11 |
118 | 44 | MA0597.2 | THAP1 | MA0597.2 | VSGCAGGGCASV | THAP1 | 9.02e-06 | 5.56e-08 | 4070 | 18.9 | 8650 | 17.1 | 1.1 |
119 | 17 | MA0464.2 | BHLHE40 | MA0464.2 | VKCACGTGMC | BHLHE40 | 1.03e-05 | 6.27e-08 | 9930 | 46.1 | 22100 | 43.9 | 1.05 |
120 | 45 | MA0615.1 | Gmeb1 | MA0615.1 | BHBBKKACGTMMNWNNN | Gmeb1 | 1.18e-05 | 7.12e-08 | 13600 | 63.2 | 30800 | 61 | 1.04 |
121 | 43 | MA1149.1 | RARA::RXRG | MA1149.1 | RRGGTCAHNNNRRGGTCA | RARA::RXRG | 1.76e-05 | 1.06e-07 | 17900 | 82.9 | 41000 | 81.2 | 1.02 |
122 | 46 | MA1971.1 | ZBED2 | MA1971.1 | NNMMCGAAACCNNV | ZBED2 | 1.88e-05 | 1.12e-07 | 5290 | 24.6 | 11400 | 22.7 | 1.08 |
123 | 17 | MA0147.3 | MYC | MA0147.3 | NNCCACGTGCNB | MYC | 3.55e-05 | 2.1e-07 | 7670 | 35.6 | 16900 | 33.6 | 1.06 |
124 | 17 | MA1560.1 | SOHLH2 | MA1560.1 | YGCACGTGCN | SOHLH2 | 3.63e-05 | 2.13e-07 | 3090 | 14.4 | 6500 | 12.9 | 1.12 |
125 | 47 | MA0682.2 (PITX1) | STREME-4 | 4-TGGGATTACAGGM | TGGGATTACAGGM | PITX1 | 4.43e-05 | 2.58e-07 | 509 | 2.36 | 885 | 1.75 | 1.35 |
126 | 48 | MA1542.1 | OSR1 | MA1542.1 | HGCTACYGTD | OSR1 | 6.61e-05 | 3.81e-07 | 16300 | 75.6 | 37200 | 73.7 | 1.03 |
127 | 17 | MA1464.1 | ARNT2 | MA1464.1 | GTCACGTGMN | ARNT2 | 7.92e-05 | 4.53e-07 | 5310 | 24.7 | 11500 | 22.9 | 1.08 |
128 | 14 | MA0139.1 | CTCF | MA0139.1 | YGRCCASYAGRKGGCRSYR | CTCF | 0.000135 | 7.66e-07 | 5300 | 24.6 | 11500 | 22.8 | 1.08 |
129 | 17 | MA1993.1 | Neurod2 | MA1993.1 | VVCAGCTGBB | Neurod2 | 0.000178 | 1e-06 | 9540 | 44.3 | 21300 | 42.3 | 1.05 |
130 | 49 | MA1575.1 | THRB | MA1575.1 | DTGACCTBRNYVAGGTCAH | THRB | 0.000204 | 1.14e-06 | 3090 | 14.3 | 6520 | 12.9 | 1.11 |
131 | 17 | MA0633.2 | Twist2 | MA0633.2 | NVCAGCTGBN | Twist2 | 0.000218 | 1.21e-06 | 9510 | 44.2 | 21300 | 42.1 | 1.05 |
132 | 17 | MA1966.1 | TFAP4::ETV1 | MA1966.1 | RSCGGAAGCAGSTGKN | TFAP4::ETV1 | 0.000228 | 1.26e-06 | 15900 | 74 | 36500 | 72.2 | 1.03 |
133 | 35 | MA1649.1 | ZBTB12 | MA1649.1 | NYCTRGAACHN | ZBTB12 | 0.000253 | 1.38e-06 | 16700 | 77.8 | 38400 | 76 | 1.02 |
134 | 33 | MA0734.3 | Gli2 | MA0734.3 | NRGACCACCCASV | Gli2 | 0.000256 | 1.39e-06 | 3290 | 15.3 | 6990 | 13.9 | 1.1 |
135 | 50 | MA1652.1 | ZKSCAN5 | MA1652.1 | NNRGGARGTGAGRR | ZKSCAN5 | 3e-04 | 1.61e-06 | 16200 | 75.2 | 37100 | 73.5 | 1.02 |
136 | 51 | MA1684.1 | Foxn1 | MA1684.1 | RGAMGC | Foxn1 | 0.00044 | 2.35e-06 | 639 | 2.97 | 1180 | 2.33 | 1.27 |
137 | 4 | MA0528.2 | ZNF263 | MA0528.2 | VNGGGGAGGASG | ZNF263 | 0.000537 | 2.85e-06 | 18100 | 84 | 41600 | 82.5 | 1.02 |
138 | 17 | MA0603.1 | Arntl | MA0603.1 | NGTCACGTGH | Arntl | 0.000551 | 2.9e-06 | 6200 | 28.8 | 13600 | 27 | 1.07 |
139 | 17 | MA0104.4 | MYCN | MA0104.4 | VVCCACGTGGBB | MYCN | 0.000576 | 3.01e-06 | 9260 | 43 | 20700 | 41.1 | 1.05 |
140 | 47 | MA0682.2 (PITX1) | MEME-2 | WGCTGGGATTACAGG | WGCTGGGATTACAGG | PITX1 | 0.00112 | 5.82e-06 | 862 | 4 | 1660 | 3.29 | 1.22 |
141 | 52 | MA1628.1 | Zic1::Zic2 | MA1628.1 | CVCAGCAGGNV | Zic1::Zic2 | 0.00127 | 6.56e-06 | 4030 | 18.7 | 8700 | 17.2 | 1.08 |
142 | 53 | MA1557.1 | SMAD5 | MA1557.1 | YGTCTAGACA | SMAD5 | 0.00156 | 8.01e-06 | 11200 | 52.1 | 25400 | 50.3 | 1.04 |
143 | 43 | MA1147.1 | NR4A2::RXRA | MA1147.1 | VRGGTCRTTGACCYY | NR4A2::RXRA | 0.00158 | 8.03e-06 | 8220 | 38.2 | 18400 | 36.4 | 1.05 |
144 | 54 | MA1711.1 | ZNF343 | MA1711.1 | CCGCTTCMCCDCGGCMV | ZNF343 | 0.00232 | 1.17e-05 | 4140 | 19.2 | 8990 | 17.8 | 1.08 |
145 | 55 | MA0646.1 | GCM1 | MA0646.1 | BATGCGGGTAC | GCM1 | 0.00242 | 1.2e-05 | 13600 | 63.4 | 31100 | 61.6 | 1.03 |
146 | 56 | MA1514.1 | KLF17 | MA1514.1 | MMCCACGCACCCMTY | KLF17 | 0.00242 | 1.2e-05 | 1880 | 8.75 | 3900 | 7.73 | 1.13 |
147 | 3 | MA1529.1 | NHLH2 | MA1529.1 | GGGNCGCAGCTGCGYCCC | NHLH2 | 0.00256 | 1.26e-05 | 4780 | 22.2 | 10400 | 20.7 | 1.07 |
148 | 47 | MA0142.1 (Pou5f1::Sox2) | STREME-8 | 8-ATCTGCACTCCCAWG | ATCTGCACTCCCAWG | Pou5f1::Sox2 | 0.00259 | 1.27e-05 | 43 | 0.2 | 35 | 0.07 | 2.86 |
149 | 17 | MA0103.3 | ZEB1 | MA0103.3 | SNCACCTGSVN | ZEB1 | 0.00275 | 1.34e-05 | 5670 | 26.4 | 12500 | 24.8 | 1.06 |
150 | 17 | MA0636.1 | BHLHE41 | MA0636.1 | RTCACGTGAC | BHLHE41 | 0.00339 | 1.64e-05 | 835 | 3.88 | 1620 | 3.21 | 1.21 |
151 | 17 | MA0608.1 | Creb3l2 | MA0608.1 | GCCACGTGD | Creb3l2 | 0.00382 | 1.84e-05 | 2650 | 12.3 | 5630 | 11.2 | 1.1 |
152 | 57 | MA1604.1 | Ebf2 | MA1604.1 | NYCCCAAGGGANN | Ebf2 | 0.00394 | 1.88e-05 | 6070 | 28.2 | 13400 | 26.6 | 1.06 |
153 | 26 | MA0641.1 | ELF4 | MA0641.1 | AACCCGGAAGTR | ELF4 | 0.00429 | 2.03e-05 | 3980 | 18.5 | 8640 | 17.1 | 1.08 |
154 | 17 | MA1945.1 | ETV5::FIGLA | MA1945.1 | RGCGGAARCAGGTGG | ETV5::FIGLA | 0.00431 | 2.03e-05 | 4650 | 21.6 | 10200 | 20.1 | 1.07 |
155 | 22 | MA0697.2 | Zic3 | MA0697.2 | CNCAGCAGGAGNN | Zic3 | 0.00489 | 2.29e-05 | 4490 | 20.8 | 9790 | 19.4 | 1.07 |
156 | 26 | MA1992.1 | Ikzf3 | MA1992.1 | NNRCAGGAAGTGGVN | Ikzf3 | 0.00549 | 2.56e-05 | 12700 | 59.1 | 28900 | 57.4 | 1.03 |
157 | 58 | MA2002.1 | Zfp335 | MA2002.1 | VNTCAGGCANH | Zfp335 | 0.00854 | 3.95e-05 | 4860 | 22.6 | 10700 | 21.2 | 1.07 |
158 | 55 | MA0767.1 | GCM2 | MA0767.1 | BATGCGGGTR | GCM2 | 0.00949 | 4.37e-05 | 7600 | 35.3 | 17000 | 33.7 | 1.05 |
159 | 59 | MA0504.1 | NR2C2 | MA0504.1 | VGRGGTCARAGGTCA | NR2C2 | 0.0105 | 4.79e-05 | 9210 | 42.8 | 20700 | 41.1 | 1.04 |
160 | 57 | MA0154.4 | EBF1 | MA0154.4 | NNTCCCCAGGGGANN | EBF1 | 0.0112 | 5.09e-05 | 4430 | 20.6 | 9700 | 19.2 | 1.07 |
161 | 17 | MA0822.1 | HES7 | MA0822.1 | YGGCACGTGCCR | HES7 | 0.0128 | 5.76e-05 | 373 | 1.73 | 665 | 1.32 | 1.32 |
162 | 60 | MA1715.1 | ZNF707 | MA1715.1 | CCCCACTCCTGGTAC | ZNF707 | 0.0141 | 6.28e-05 | 6650 | 30.9 | 14800 | 29.3 | 1.05 |
163 | 61 | MA0738.1 | HIC2 | MA0738.1 | RTGCCCRSB | HIC2 | 0.0141 | 6.28e-05 | 1310 | 6.08 | 2680 | 5.3 | 1.15 |
164 | 62 | MA1964.1 | SMAD2 | MA1964.1 | NBCCAGACDN | SMAD2 | 0.0145 | 6.44e-05 | 16500 | 76.6 | 37900 | 75.2 | 1.02 |
165 | 43 | MA1535.1 | NR2C1 | MA1535.1 | NRRGGTCAN | NR2C1 | 0.0149 | 6.57e-05 | 15900 | 73.6 | 36400 | 72.1 | 1.02 |
166 | 63 | MA1554.1 | RFX7 | MA1554.1 | CGTTGCYAY | RFX7 | 0.0155 | 6.8e-05 | 17800 | 82.5 | 41000 | 81.2 | 1.02 |
167 | 17 | MA1493.1 | HES6 | MA1493.1 | GGCACGTGTY | HES6 | 0.0321 | 0.00014 | 9020 | 41.9 | 20400 | 40.3 | 1.04 |
168 | 45 | MA1475.1 | CREB3L4 | MA1475.1 | GRTGACGTCAYC | CREB3L4 | 0.033 | 0.000143 | 14400 | 67 | 33100 | 65.5 | 1.02 |
11 | 65 | MA1125.1 (ZNF384) | MEME-1 | TTTTTTTTTWTTTTT | TTTTTTTTTWTTTTT | ZNF384 | 0.036 | 0.00328 | 734 | 3.41 | 1520 | 3.01 | 1.13 |
169 | 26 | MA0098.3 | ETS1 | MA0098.3 | ACCGGAARTR | ETS1 | 0.0396 | 0.000171 | 12700 | 58.8 | 28900 | 57.3 | 1.03 |
170 | 17 | MA0499.2 | MYOD1 | MA0499.2 | NNGCACCTGTCNB | MYOD1 | 0.0424 | 0.000181 | 16900 | 78.5 | 38900 | 77.1 | 1.02 |
171 | 64 | MA1616.1 | Prdm15 | MA1616.1 | NRGAAAACCTGGAVN | Prdm15 | 0.0431 | 0.000183 | 19100 | 88.7 | 44200 | 87.6 | 1.01 |
12 | 66 | MA1135.1 (FOSB::JUNB) | MEME-3 | ATGACTCA | ATGACTCA | FOSB::JUNB | 12 | 1 | 641 | 2.98 | 1820 | 3.61 | 0.826 |
DNR_24_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DNR_24_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
DNR_24_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*2), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./2,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("DNR_24_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
MTX_3_DAR_DEG_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/MTX_3_DEG_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
MTX_3_DAR_DEG_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/MTX_3_DEG_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
MTX_3_DAR_DEG_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/MTX_3_DEG_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
MTX_3_DAR_DEG_sea_all <- rbind(MTX_3_DAR_DEG_sea_disc, MTX_3_DAR_DEG_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
MTX_3_DAR_DEG_combo <- MTX_3_DAR_DEG_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., MTX_3_DAR_DEG_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
MTX_3_DAR_DEG_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in MTX v VEH") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
5 | 5 | 5-AGAGGARGGAGATRA | STREME-5 | 5-AGAGGARGGAGATRA | AGAGGARGGAGATRA | AGAGGARGGAGATRA | 0.000325 | 6.51e-05 | 2 | 28.6 | 0 | 0 | 299 |
1 | 1 | MA1629.1 (Zic2) | STREME-1 | 1-TCCCAGCASA | TCCCAGCASA | Zic2 | 0.00162 | 0.00148 | 5 | 71.4 | 21 | 2.63 | 27.2 |
2 | 2 | 3-TAGGCCAS | STREME-3 | 3-TAGGCCAS | TAGGCCAS | TAGGCCAS | 0.0254 | 0.0101 | 4 | 57.1 | 15 | 1.88 | 31.2 |
3 | 3 | MA1557.1 (SMAD5) | STREME-2 | 2-CTAGCCA | CTAGCCA | SMAD5 | 0.0477 | 0.0101 | 6 | 85.7 | 82 | 10.3 | 8.41 |
4 | 4 | MA0686.1 (SPDEF) | STREME-4 | 4-AGATCC | AGATCC | SPDEF | 0.0493 | 0.0101 | 7 | 100 | 150 | 18.8 | 5.28 |
MTX_3_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("MTX_3_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
MTX_3_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*.05), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./.05,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("MTX_3_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
MTX_24_DAR_DEG_sea_disc <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/MTX_24_DEG_xstreme/sea_disc_out/sea.tsv") %>% mutate(source="disc") %>% slice_head(n = length(.$ID)-3)
MTX_24_DAR_DEG_sea_known <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/MTX_24_DEG_xstreme/sea_out/sea.tsv") %>% mutate(source="known") %>% slice_head(n = length(.$ID)-3)
MTX_24_DAR_DEG_xstreme <- read.delim("C:/Users/renee/ATAC_folder/ATAC_meme_data/new_analysis/MTX_24_DEG_xstreme/xstreme.tsv") %>% slice_head(n = length(.$ID)-3)
MTX_24_DAR_DEG_sea_all <- rbind(MTX_24_DAR_DEG_sea_disc, MTX_24_DAR_DEG_sea_known) %>%
arrange(ID, desc(source == "known")) %>%
distinct(ID, .keep_all = TRUE)
MTX_24_DAR_DEG_combo <- MTX_24_DAR_DEG_xstreme %>%
dplyr::select(SEED_MOTIF:EVALUE_ACC,SIM_MOTIF) %>%
left_join(., MTX_24_DAR_DEG_sea_all, by=c("ID"="ID","ALT_ID"="ALT_ID")) %>%
mutate(motif_name=case_when(
str_detect(SIM_MOTIF, "\\(") ~ str_extract(SIM_MOTIF, "(?<=\\().+?(?=\\))"),
str_detect(SIM_MOTIF, "^MA\\d+\\.\\d+") ~ ALT_ID,
str_detect(SIM_MOTIF, "^\\d+-") ~ str_replace(SIM_MOTIF, "^\\d+-", ""),
TRUE ~ SIM_MOTIF
)) %>%
dplyr::select(RANK,CLUSTER,SIM_MOTIF,ALT_ID, ID,CONSENSUS.x,motif_name,EVALUE.y,QVALUE,TP:ENR_RATIO) %>%
arrange(.,EVALUE.y)
MTX_24_DAR_DEG_combo %>%
dplyr::mutate_if(is.numeric, funs(as.character(signif(., 3)))) %>%
kable(., caption = "Enriched motifs 200 bp in MTX v VEH") %>%
kable_paper("striped", full_width = TRUE) %>%
kable_styling(full_width = FALSE, font_size = 16) %>%
scroll_box(height = "500px")
RANK | CLUSTER | SIM_MOTIF | ALT_ID | ID | CONSENSUS.x | motif_name | EVALUE.y | QVALUE | TP | TP. | FP | FP. | ENR_RATIO |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 1 | MA1995.1 (Npas4) | STREME-1 | 1-GTCTCGATG | GTCTCGATG | Npas4 | 0.000272 | 0.00026 | 97 | 7.32 | 853 | 4.16 | 1.77 |
2 | 2 | MA1489.1 (FOXN3) | STREME-2 | 2-AGGAAACAAAACTK | AGGAAACAAAACTK | FOXN3 | 0.00525 | 0.00263 | 15 | 1.13 | 81 | 0.4 | 3.01 |
2 | 8 | MA0907.1 | HOXC13 | MA0907.1 | BCTCGTAAAAH | HOXC13 | 0.0208 | 0.00996 | 52 | 3.92 | 420 | 2.05 | 1.94 |
3 | 3 | MA1511.2 (KLF10) | MEME-4 | GSGCGGGGSSSGGGG | GSGCGGGGSSSGGGG | KLF10 | 0.0622 | 0.0207 | 120 | 9.05 | 1470 | 7.18 | 1.27 |
4 | 4 | MA1547.2 (PITX2) | MEME-5 | TAATCCCARCACTTT | TAATCCCARCACTTT | PITX2 | 0.339 | 0.0848 | 38 | 2.87 | 430 | 2.1 | 1.4 |
5 | 5 | MA1107.2 (KLF9) | MEME-1 | ACACACACACACACA | ACACACACACACACA | KLF9 | 1.03 | 0.207 | 32 | 2.41 | 395 | 1.93 | 1.29 |
6 | 6 | MA0476.1 (FOS) | MEME-2 | TGASTCAT | TGASTCAT | FOS | 3.51 | 0.569 | 22 | 1.66 | 323 | 1.58 | 1.1 |
7 | 7 | MA1125.1 (ZNF384) | MEME-3 | TTKTTTTTWTTTTTW | TTKTTTTTWTTTTTW | ZNF384 | 3.99 | 0.569 | 1 | 0.08 | 10 | 0.05 | 2.81 |
8 | 8 | MA0907.1 (HOXC13) | STREME-3 | 3-CTCGTAAAGTYS | CTCGTAAAGTYS | HOXC13 | 6.01 | 0.752 | 421 | 31.8 | 6690 | 32.6 | 0.975 |
MTX_24_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
slice_head(n=80)%>%
mutate(log10Evalue= log(EVALUE.y, base = 10)*(-1)) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*1.25), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./1.25,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("MTX_24_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
MTX_24_DAR_DEG_combo %>%
dplyr::filter(ENR_RATIO>ER_rat) %>%
dplyr::select(RANK,CLUSTER,motif_name,SIM_MOTIF,ALT_ID, ID,EVALUE.y,TP.)%>%
arrange(.,EVALUE.y) %>%
mutate(log10Evalue = -log10(if_else(EVALUE.y == 0, 1e-300, EVALUE.y))) %>%
distinct(CLUSTER,.keep_all = TRUE) %>%
slice_head(n=5) %>%
mutate(motif_name=if_else(str_starts(motif_name,"^Z*"),paste(motif_name, RANK, sep="_"), if_else(motif_name=="KLF9",paste(motif_name, RANK,sep="_"),motif_name)))%>%
ggplot(., aes (y= reorder(motif_name,log10Evalue))) +
geom_col(aes(x=log10Evalue)) +
geom_point(aes(x=`TP.`*.1), size =4)+
scale_x_continuous(expand=c (0,.125),sec.axis = sec_axis(transform= ~./.1,name="Percent of peaks with motif"))+
theme_classic()+
ylab("Enriched TF motif")+
ggtitle(paste("MTX_24_DAR_DEG_combo response peaks(200bp)Enrichment ratio:",ER_rat))
sessionInfo()
R version 4.4.2 (2024-10-31 ucrt)
Platform: x86_64-w64-mingw32/x64
Running under: Windows 11 x64 (build 26100)
Matrix products: default
locale:
[1] LC_COLLATE=English_United States.utf8
[2] LC_CTYPE=English_United States.utf8
[3] LC_MONETARY=English_United States.utf8
[4] LC_NUMERIC=C
[5] LC_TIME=English_United States.utf8
time zone: America/Chicago
tzcode source: internal
attached base packages:
[1] grid stats4 stats graphics grDevices utils datasets
[8] methods base
other attached packages:
[1] ggseqlogo_0.2
[2] universalmotif_1.24.2
[3] data.table_1.17.8
[4] BSgenome.Hsapiens.UCSC.hg38_1.4.5
[5] BSgenome_1.74.0
[6] BiocIO_1.16.0
[7] MotifDb_1.48.0
[8] Biostrings_2.74.1
[9] XVector_0.46.0
[10] JASPAR2022_0.99.8
[11] BiocFileCache_2.14.0
[12] dbplyr_2.5.0
[13] devtools_2.4.5
[14] usethis_3.1.0
[15] ggpubr_0.6.1
[16] BiocParallel_1.40.2
[17] Cormotif_1.52.0
[18] affy_1.84.0
[19] scales_1.4.0
[20] VennDiagram_1.7.3
[21] futile.logger_1.4.3
[22] gridExtra_2.3
[23] ggfortify_0.4.19
[24] edgeR_4.4.2
[25] limma_3.62.2
[26] rtracklayer_1.66.0
[27] org.Hs.eg.db_3.20.0
[28] TxDb.Hsapiens.UCSC.hg38.knownGene_3.20.0
[29] GenomicFeatures_1.58.0
[30] AnnotationDbi_1.68.0
[31] Biobase_2.66.0
[32] GenomicRanges_1.58.0
[33] GenomeInfoDb_1.42.3
[34] IRanges_2.40.1
[35] S4Vectors_0.44.0
[36] BiocGenerics_0.52.0
[37] ChIPseeker_1.42.1
[38] RColorBrewer_1.1-3
[39] broom_1.0.9
[40] kableExtra_1.4.0
[41] cowplot_1.2.0
[42] lubridate_1.9.4
[43] forcats_1.0.0
[44] stringr_1.5.1
[45] dplyr_1.1.4
[46] purrr_1.1.0
[47] readr_2.1.5
[48] tidyr_1.3.1
[49] tibble_3.3.0
[50] ggplot2_3.5.2
[51] tidyverse_2.0.0
[52] workflowr_1.7.1
loaded via a namespace (and not attached):
[1] fs_1.6.6
[2] matrixStats_1.5.0
[3] bitops_1.0-9
[4] enrichplot_1.26.6
[5] httr_1.4.7
[6] profvis_0.4.0
[7] tools_4.4.2
[8] backports_1.5.0
[9] R6_2.6.1
[10] lazyeval_0.2.2
[11] urlchecker_1.0.1
[12] withr_3.0.2
[13] preprocessCore_1.68.0
[14] cli_3.6.5
[15] textshaping_1.0.1
[16] formatR_1.14
[17] labeling_0.4.3
[18] sass_0.4.10
[19] Rsamtools_2.22.0
[20] systemfonts_1.2.3
[21] yulab.utils_0.2.0
[22] DOSE_4.0.1
[23] svglite_2.2.1
[24] R.utils_2.13.0
[25] dichromat_2.0-0.1
[26] sessioninfo_1.2.3
[27] plotrix_3.8-4
[28] rstudioapi_0.17.1
[29] RSQLite_2.4.2
[30] generics_0.1.4
[31] gridGraphics_0.5-1
[32] TxDb.Hsapiens.UCSC.hg19.knownGene_3.2.2
[33] gtools_3.9.5
[34] car_3.1-3
[35] GO.db_3.20.0
[36] Matrix_1.7-3
[37] abind_1.4-8
[38] R.methodsS3_1.8.2
[39] lifecycle_1.0.4
[40] whisker_0.4.1
[41] yaml_2.3.10
[42] carData_3.0-5
[43] SummarizedExperiment_1.36.0
[44] gplots_3.2.0
[45] qvalue_2.38.0
[46] SparseArray_1.6.2
[47] blob_1.2.4
[48] promises_1.3.3
[49] crayon_1.5.3
[50] miniUI_0.1.2
[51] ggtangle_0.0.7
[52] lattice_0.22-7
[53] KEGGREST_1.46.0
[54] pillar_1.11.0
[55] knitr_1.50
[56] fgsea_1.32.4
[57] rjson_0.2.23
[58] boot_1.3-31
[59] codetools_0.2-20
[60] fastmatch_1.1-6
[61] glue_1.8.0
[62] getPass_0.2-4
[63] ggfun_0.2.0
[64] remotes_2.5.0
[65] vctrs_0.6.5
[66] png_0.1-8
[67] treeio_1.30.0
[68] gtable_0.3.6
[69] cachem_1.1.0
[70] xfun_0.52
[71] S4Arrays_1.6.0
[72] mime_0.13
[73] statmod_1.5.0
[74] ellipsis_0.3.2
[75] nlme_3.1-168
[76] ggtree_3.14.0
[77] bit64_4.6.0-1
[78] filelock_1.0.3
[79] rprojroot_2.1.0
[80] bslib_0.9.0
[81] affyio_1.76.0
[82] KernSmooth_2.23-26
[83] splitstackshape_1.4.8
[84] colorspace_2.1-1
[85] DBI_1.2.3
[86] tidyselect_1.2.1
[87] processx_3.8.6
[88] bit_4.6.0
[89] compiler_4.4.2
[90] curl_6.4.0
[91] git2r_0.36.2
[92] xml2_1.3.8
[93] DelayedArray_0.32.0
[94] caTools_1.18.3
[95] callr_3.7.6
[96] digest_0.6.37
[97] rmarkdown_2.29
[98] htmltools_0.5.8.1
[99] pkgconfig_2.0.3
[100] MatrixGenerics_1.18.1
[101] fastmap_1.2.0
[102] rlang_1.1.6
[103] htmlwidgets_1.6.4
[104] UCSC.utils_1.2.0
[105] shiny_1.11.1
[106] farver_2.1.2
[107] jquerylib_0.1.4
[108] jsonlite_2.0.0
[109] GOSemSim_2.32.0
[110] R.oo_1.27.1
[111] RCurl_1.98-1.17
[112] magrittr_2.0.3
[113] Formula_1.2-5
[114] GenomeInfoDbData_1.2.13
[115] ggplotify_0.1.2
[116] patchwork_1.3.1
[117] Rcpp_1.1.0
[118] ape_5.8-1
[119] stringi_1.8.7
[120] MASS_7.3-65
[121] zlibbioc_1.52.0
[122] plyr_1.8.9
[123] pkgbuild_1.4.8
[124] parallel_4.4.2
[125] ggrepel_0.9.6
[126] splines_4.4.2
[127] hms_1.1.3
[128] locfit_1.5-9.12
[129] ps_1.9.1
[130] igraph_2.1.4
[131] ggsignif_0.6.4
[132] reshape2_1.4.4
[133] pkgload_1.4.0
[134] futile.options_1.0.1
[135] XML_3.99-0.18
[136] evaluate_1.0.4
[137] lambda.r_1.2.4
[138] BiocManager_1.30.26
[139] tzdb_0.5.0
[140] httpuv_1.6.16
[141] xtable_1.8-4
[142] restfulr_0.0.16
[143] tidytree_0.4.6
[144] rstatix_0.7.2
[145] later_1.4.2
[146] viridisLite_0.4.2
[147] aplot_0.2.8
[148] memoise_2.0.1
[149] GenomicAlignments_1.42.0
[150] timechange_0.3.0