Human RNA MaP

How to predict by coordinate

Input the genomic coordinates of your region of interest, using hg38 as the reference genome. This input is the most flexible, and is unconstrained by sequencing coverage or any annotation. Be sure to double-check that the output sequence matches your expected region, and check that the output coverage of A/C bases is above the recommended 70%, which is reflective of DMS-modifiable bases meeting all coverage and quality filters.

For most users, only one set of coordinates will be used to define their region of interest. However, two sets of coordinates may be needed when joining two separate regions together, such as when crossing a splice junction.

When analyzing a region within a larger transcript, it is generally recommended to test “buffer” regions, where the region of interest should be extended by 20-50 nt on each end in order to reduce the likelihood of structures being arbitrarily interrupted by the region borders.

Due to computational and server constraints, users will be limited to a maximum input length of 500 nucleotides and a maximum output of 5 predicted structures, though some regions may yield fewer than that. If this does not suit your needs, consider downloading and running the data and code locally (see Download ).

Predict by Gene

Advanced Options

How was this calculated?

THAP12
ENST00000260045 (-) ENSG00000137492 (-)

Custom Window:

Start

Size

100

150

300

500

Strand

+

-

Predict

	Region	Local Coords	Chrom Coords	Length	R	N	Gini	Action
1	5'UTR	103-190	76380943-76381030	88	0.1288	50	0.31	View
2	5'UTR,CDS	191-301	76380832-76380942	111	0.9425	50	0.66	View
3	CDS	303-391	76365967-76365972 / 76380748-76380830	89	0.7248	50	0.28	View
4	CDS	393-486	76365872-76365965	94	0.5515	50	0.18	View
5	CDS	487-578	76360992-76361063 / 76365852-76365871	92	0.7271	50	0.28	View
6	CDS	579-650	76355619-76355654 / 76360956-76360991	72	0.7212	50	0.23	View
7	CDS	651-722	76352724-76352794 / 76355618-76355618	72	0.7490	50	0.29	View
8	CDS	724-810	76352636-76352722	87	0.8050	50	0.31	View
9	CDS	812-906	76352540-76352634	95	0.8881	50	0.36	View
10	CDS	909-1015	76352431-76352537	107	0.7839	50	0.27	View
11	CDS	1017-1119	76352327-76352429	103	0.8235	50	0.26	View
12	CDS	1120-1231	76352215-76352326	112	0.8546	50	0.27	View
13	CDS	1233-1351	76352095-76352213	119	0.7376	50	0.22	View
14	CDS	1352-1462	76351984-76352094	111	0.8107	50	0.25	View
15	CDS	1469-1559	76351887-76351977	91	0.8514	50	0.31	View
16	CDS	1560-1670	76351776-76351886	111	0.7809	50	0.26	View
17	CDS	1674-1790	76351656-76351772	117	0.7229	50	0.27	View
18	CDS	1791-1891	76351555-76351655	101	0.6797	50	0.29	View
19	CDS	1894-1984	76351462-76351552	91	0.8825	50	0.38	View
20	CDS	1985-2071	76351375-76351461	87	0.8277	50	0.32	View
21	CDS	2073-2158	76351288-76351373	86	0.7081	50	0.24	View
22	CDS	2161-2249	76351197-76351285	89	0.7444	50	0.28	View
23	CDS	2250-2352	76351094-76351196	103	0.8546	50	0.32	View
24	CDS	2355-2453	76350993-76351091	99	0.7828	50	0.28	View
25	CDS	2458-2550	76350896-76350988	93	0.6580	50	0.23	View
26	CDS,3'UTR	2551-2650	76350796-76350895	100	0.5774	50	0.23	View
27	3'UTR	2651-2744	76350702-76350795	94	0.7434	50	0.23	View
28	3'UTR	2745-2860	76350586-76350701	116	0.8218	50	0.30	View
29	3'UTR	2862-2983	76350463-76350584	122	0.8306	50	0.28	View
30	3'UTR	2987-3078	76350368-76350459	92	0.7620	50	0.29	View
31	3'UTR	3079-3193	76350253-76350367	115	0.8722	50	0.39	View
32	3'UTR	3195-3316	76350130-76350251	122	0.8886	50	0.39	View
33	3'UTR	3317-3409	76350037-76350129	93	0.7086	50	0.33	View
34	3'UTR	3410-3473	76349973-76350036	64	0.3226	28	0.33	View
Download CSV

Custom Gene Windows

Gene Window Instructions/Interpretation

Search for a gene by name. For each gene, only the canonical transcript (as defined by UCSC in GRCh38.106) is shown. Some gene names may correspond to multiple genomic locations.
For the selected gene, a scatterplot showing the Gini indices, where a high Gini index corresponds to a highly-structured region, for small windows within the transcript. For small RNAs, these windows are sized 20 valid (coverage- and quality-filtered) A/C data points; for all other transcripts, these windows are sized 50 valid A/C data points, due to the ability of DMS to modify primarily A/C bases. In regions with sufficient coverage, these windows correspond to actual transcript lengths of roughly 40 and 100 nucleotides, respectively.
Select a window of interest, either from the scatter plot or from the table below. Generally, windows with a high Gini will yield better results, where the DMS signal aligns more accurately with the predicted base-pais. Windows with high Gini indices relative to the rest of the transcript can also be used to find functional structural elements (ex: TFRC iron response elements).
If previously-defined windows do not suit your needs, you can define a custom window either using (1) the slider below the scatterplot, which shows both a heatmap of previously-defined windows as well as the locations of each UTR and CDS; or (2) custom coordinate-based entry via Predict by Coordinates.

See About for more information on the dataset and for best practices in structure determination.

completed

6
THAP12

Aoi:	Chr:	Coords	Strand:
THAP12	11	76355619 - 76355654, 76360956 - 76360991	-

Download Data & Images

Annotation Based on canonical annotations, the following gene is in your area of interest: THAP12(-)
Length Length of region: 72 nt.
Coverage 100.0% of the region's A/C bases included in the filtered DMS dataset. Ideally, this number should be as high as possible for an experimenally-accurate prediction, and over 70% is recommended.
Structures This region has 1 maximum predicted structures. 5 is the maximum structures visible on this site. For up to 20 predictions per region, download and run the code locally.

Predict by Gene

THAP12 ENST00000260045 (-) ENSG00000137492 (-)

Custom Gene Windows

completed 6 THAP12

THAP12
ENST00000260045 (-) ENSG00000137492 (-)

completed

6
THAP12