CIRCpedia v3 is a comprehensively upgraded circular RNA research platform integrating 2,350 NGS and 63 TGS datasets spanning 20 species. It systematically annotates 2.6 million circular RNAs with conservation insights, cognate linear RNA expression profiles, and degradation dynamics. The update introduces four innovative modules: gene editing validation, therapeutic circRNA optimization, subcellular localization prediction, and multi-molecular interaction analysis. An enhanced interactive visualization suite enables dynamic data filtering, cross-dataset comparison, and real-time analytics. Besides, many additional functions are included in CIRCpedia v3 to support circRNA studies, including but not limited to: predicting ORFs, IRES activity, and m6A modification sites; designing divergent PCR primers and siRNAs; converting IDs across different circRNA databases.
A comprehensive collection of samples are collected from five publicly accessible repositories: RNA Atlas , ENCODE Project, Non-Human Primate Reference Transcriptome Resource (NHPRTR), NCBI Sequence Read Archive (SRA), and Genome Sequence Archive (GSA). Apart from whole organism samples of worm and fly across different developmental stages, all samples were manually curated and systematically assigned into 26 organ systems based on metadata from source studies or public repositories. (Data summary table)
All samples are classified into whole organism, tissues, cell lines and subcellular fractions. Especially, hundreds of exosome from blood fractions of control and eight diseased cohorts are included in our database for exploration.
Users can retrived the expression of circular RNAs (FPBcirc), their cognate linear RNA (FPBlinear) and the expression of circular RNAs relative to their cognate linear RNA (CIRCscore) in "Search" module. And we provied serveral filters to help users to find the samples of interest:
Note: user must specify the genome locus by providing (zero-based) genome locations or Refeq/Ensembl gene id/symbols. As a option, they can further search the specific circular RNA with recommanded name [eg. circCAMSAP1(6,7)] or circID (eg. CIRCHSA_CAMSAP1_2).
Besides, clicking the circID in results opens a dedicated page of comprehensive annotation for this circular RNA. And Clicking the Location redirects to JBrowser to visualize this circular RNA's position in the genome.
Users can analyze the numbers, length and expression of circular RNA between selected samples.
Users can predicte the interaction between circular RNAs and RNA binding proetins (RBP), mRNAs/lncRNAs or miRNAs.
We recently identified a DIS3 (chromosome disjunction protein 3, exosome endoribonuclease, and 3’–5’ exoribonuclease) mediated circRNA decay pathway which regulates the turnover of more than half of circRNAs by preferentially targeting U-rich motifs. Accroding to the reported method, we provides prediction of DIS3 degradation signal in circRNAs. Then, the sequences was optimized by base (for non-coding sequence) or synonymous codon (for coding sequence) replacements in signal-overlapping regions to minimize uridine (U) content to design more stable RNA cicles.
We also provided functions to predict the open reading frames (ORFs) and internal ribosome entry sites (IRESs) within circRNAs.
To predict potential m6A modification sites on circRNAs, circRNA sequences were first scanned for canonical m6A motifs (DRACH, where D = G/A/T, R = G/A, H = A/C/T), and then compared with experimentally validated m6A peaks deposited in the REPIC database to identify overlapped ones for confidence m6A sites.
To design divergent PCR primers for each circular RNA, we first parsed the circular RNA sequences to identify the BSJ sequences. Subsequently, we utilized Primer3 (v2.6.1) to generate divergent PCR primers that amplify products spanning the BSJ.
To design siRNAs targeting BSJs of circular RNAs, we parsed the circular RNA sequences to get BSJ sequences. Then EMBOSS sirna (v6.6.0) was used to generate multiple siRNA sequences that target the BSJ sequences, with scores for user selection.