RMINC visualization tools are versatile, but how the results are visualized are not super obvious. Here, I’m demonstrating a little bit of what’s happening under the hood using Four Core Genotype rat data that Myrto Lavda had gathered.

Myrto had surgically induced strokes on some of these animals to understand how sex hormones and chromosomes impact stroke outcomes in the brain. We will be comparing the difference in volume between rats of different genotypes and stages of the stroke experiment.

Set up

Reading in necessary packages

packages <- c("tidyverse", "RMINC", "MRIcrotome", "broom", "forcats", "gridExtra", "lspline", "emmeans", "png", "Cairo", "ggplot2", "rstatix")

invisible(sapply(packages, function(x) library(x, character.only = TRUE)))

── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.4     ✔ readr     2.1.5
✔ forcats   1.0.0     ✔ stringr   1.5.1
✔ ggplot2   3.5.1     ✔ tibble    3.3.0
✔ lubridate 1.9.4     ✔ tidyr     1.3.1
✔ purrr     1.0.4     
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors

Attaching package: 'MRIcrotome'


The following object is masked from 'package:graphics':

    legend



Attaching package: 'gridExtra'


The following object is masked from 'package:dplyr':

    combine


Welcome to emmeans.
Caution: You lose important information if you filter this package's results.
See '? untidy'


Attaching package: 'rstatix'


The following object is masked from 'package:stats':

    filter

Reading in the data

The output of a PydPiper pipeline includes different types of Jacobian determinants for each image. The full Jacobian (log_full_det) is used when one’s interested in volumetric effects and not accounting for total brain volume (TBV). Nonlinear Jacobian (log_nlin_det) is used when TBV should be accounted.

PydPiper saves these determinants after different levels of blurring have been applied to them at each voxel. A FWHM blur of 0.2 is usually used when the analyses are at the voxel level. FWHM of 0 (no blurring) is used when analyses are done at the regional level (since noise from voxel-wise determinants can be reduced when the voxels determinants are averaged/combined to make a regional volume).

The code below reads in a csv, produced from PydPiper pipeline, that indicates where the determinant files for each image/animal are located. Files with FWHM 0.2 are chosen because analyses will be done at the voxel level. Some text manipulations are done to animal ID and time point for ease of analyses later on. A new variable, ‘stage’, is made to determine if the animal was scanned at baseline, after acute stroke, or doing recovery using their file names.

pipeout <- read_csv("/well/lerch/users/dqp841/FCG_Rat_Stroke/pipelines/analysis.csv") %>% 
  dplyr::filter(fwhm==0.2)%>%
  mutate(Animal.ID = basename(log_nlin_det), # the following lines are modifying animal IDs and category to make them suitable for subsequent analyses 
         Animal.ID = str_replace(Animal.ID, "RNC4", "RNEC4"),
         Animal.ID = str_replace(Animal.ID, "RNEC4_1_Base", "RNEC4_1c_Base"),
         Animal.ID = str_replace(Animal.ID, "RNEC1_1b", "RNEC3_1b"),
         Animal.ID = gsub('.+Stroke_(.+_.+)_[PBF].+', '\\1',Animal.ID),
         timepoint = str_split(Animal.ID, "_") %>% map_chr(.,~ifelse(length(.x)>2, .x[3], "PS3")),
         timepoint = ifelse(timepoint == "a", "Baseline", timepoint),
         timepoint = ifelse(timepoint == "b", "Baseline", timepoint),
         Animal.ID = str_split(Animal.ID, "_") %>% map_chr(.,~paste(.x[1],.x[2],sep=".")),
         stage = case_when(str_detect(log_nlin_det, "Baseline") ~ "baseline",
      str_detect(log_nlin_det, "baseline") ~ "baseline",
      str_detect(log_nlin_det, "PS37") ~ "recovery",
      str_detect(log_nlin_det, "Recovery") ~ "recovery",
      str_detect(log_nlin_det, "PS30") ~ "recovery",
      str_detect(log_nlin_det, "PS3\\d") ~ "recovery",
      str_detect(log_nlin_det, "PS3") ~ "acute",
      str_detect(log_nlin_det, "RNEC3_1a_FCGStro") ~ "acute",
      str_detect(log_nlin_det, "RNEC1_1b") ~ "acute",
      TRUE ~ "undetected"
    ))

Rows: 216 Columns: 12
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (11): rigid_xfm, lsq12_nlin_xfm, overall_xfm, native_file, lsq6_file, ls...
dbl  (1): fwhm

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

head(pipeout)

# A tibble: 6 × 15
  rigid_xfm      lsq12_nlin_xfm overall_xfm native_file lsq6_file lsq6_mask_file
  <chr>          <chr>          <chr>       <chr>       <chr>     <chr>         
1 FCGRat-t2-202… FCGRat-t2-202… FCGRat-t2-… /well/lerc… FCGRat-t… FCGRat-t2-202…
2 FCGRat-t2-202… FCGRat-t2-202… FCGRat-t2-… /well/lerc… FCGRat-t… FCGRat-t2-202…
3 FCGRat-t2-202… FCGRat-t2-202… FCGRat-t2-… /well/lerc… FCGRat-t… FCGRat-t2-202…
4 FCGRat-t2-202… FCGRat-t2-202… FCGRat-t2-… /well/lerc… FCGRat-t… FCGRat-t2-202…
5 FCGRat-t2-202… FCGRat-t2-202… FCGRat-t2-… /well/lerc… FCGRat-t… FCGRat-t2-202…
6 FCGRat-t2-202… FCGRat-t2-202… FCGRat-t2-… /well/lerc… FCGRat-t… FCGRat-t2-202…
# ℹ 9 more variables: nlin_file <chr>, nlin_mask_file <chr>, fwhm <dbl>,
#   log_nlin_det <chr>, log_full_det <chr>, label_file <chr>, Animal.ID <chr>,
#   timepoint <chr>, stage <chr>

Reading in animal meta information.

# reading in the data and keeping only IDs that are unique (removing duplicates essentially)
ratinfo <- read_csv("/well/lerch/users/dqp841/FCG_Rat_Stroke/analyses/Master_Data_File - info-all_clean.csv") %>% distinct(`Animal ID`, .keep_all = TRUE) %>% rename(Animal.ID = `Animal ID`)

Rows: 36 Columns: 10
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (3): Animal ID, Sex, Genotype
dbl (7): #, Age (Baseline), Age (Acute), Age (Recovery), Weight (Baseline), ...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

# now we need to join the pipeout data frame with rat info data frame by animal ID. first, we need to see if the ID matches up across pipeout and rat info. 
head(pipeout$Animal.ID)

[1] "RNEC4.1a" "RNEC4.1d" "RNEC4.1f" "RNEC3.1b" "RNEC1.1g" "RNEC1.1g"

head(ratinfo$Animal.ID)

[1] "RNEC1.1p" "RNEC1.1m" "RNEC1.1n" "RNEC4.1j" "RNEC4.1i" "RNEC4.1l"

# looks the same. joining them together. setting the stage variable as a factor variable with hierarchy baseline, acute, recovery. adding full paths to log_nlin_det, log_full_det, and label_file. 
gf <- left_join(pipeout, ratinfo, by="Animal.ID") %>%
  mutate(stage = factor(stage, levels=c("baseline", "acute", "recovery")),
         genotype = factor(Genotype, levels=c("XX", "XY", "XYSry", "XXSry")),
         log_nlin_det = paste0("/well/lerch/users/dqp841/FCG_Rat_Stroke/pipelines/", log_nlin_det),
         log_full_det = paste0("/well/lerch/users/dqp841/FCG_Rat_Stroke/pipelines/", log_full_det),
         label_file=paste0("/well/lerch/users/dqp841/FCG_Rat_Stroke/pipelines/", label_file))

#  getting a summary output of the genotype and stages
gf %>% group_by(genotype, stage) %>% summarise(count = n()) %>% pivot_wider(names_from = stage, values_from = count, id_cols = "genotype") %>% mutate(Sum = rowSums(across(where(is.numeric))))

`summarise()` has grouped output by 'genotype'. You can override using the
`.groups` argument.

# A tibble: 4 × 5
# Groups:   genotype [4]
  genotype baseline acute recovery   Sum
  <fct>       <int> <int>    <int> <dbl>
1 XX              9     9        9    27
2 XY              9     9        9    27
3 XYSry          10    10       10    30
4 XXSry           8     8        8    24

Reading in the consensus/averaged segmented image generated after nonlinear transformation round 3 (nlin-3). This will be used downstream as a mask to indicate which voxels in the images can be used to make linear models.

mask <- "/well/lerch/users/dqp841/FCG_Rat_Stroke/pipelines/FCGRat-t2-2024-10-03_nlin/FCGRat-t2-2024-10-03-nlin-3/FCGRat-t2-2024-10-03-nlin-3_voted.mnc"

Analyses

In the following examples, different types of models are fitted to answer some very specific questions.

Simple case: vs_genotype model

Here, the mincLm function from the RMINC package is used. This function allows you to fit a linear model across all voxels in the data.

The model is addressing this question: which voxels show statistically significant volumetric differences between the genotypes at the baseline time point?

Notice the mask imported earlier is used here to help mincLm determine which voxels in the images that can be included for linear modeling.

# note that this is using the log_full_determinants. total brain volume are not accounted for in this model. to do that, you'd use relative determinants. 
vs_genotype <- mincLm(log_full_det ~ genotype, gf %>% dplyr::filter(stage == "baseline"), mask=mask)

Method: lm
Number of volumes: 36
Volume sizes: 67 131 85
N: 36 P: 4
In slice 
 0  1  2  3  4  5  6  7  8  9  10  11  12  13  14  15  16  17  18  19  20  21  22  23  24  25  26  27  28  29  30  31  32  33  34  35  36  37  38  39  40  41  42  43  44  45  46  47  48  49  50  51  52  53  54  55  56  57  58  59  60  61  62  63  64  65  66 
Done

I want to explore the outputs of vs_genotype and build up my understanding of what’s available for visualization purposes from there.

# what's in vs_genotype 
colnames(vs_genotype)

 [1] "F-statistic"          "R-squared"            "beta-(Intercept)"    
 [4] "beta-genotypeXY"      "beta-genotypeXYSry"   "beta-genotypeXXSry"  
 [7] "tvalue-(Intercept)"   "tvalue-genotypeXY"    "tvalue-genotypeXYSry"
[10] "tvalue-genotypeXXSry" "logLik"

head(vs_genotype)

     F-statistic R-squared beta-(Intercept) beta-genotypeXY beta-genotypeXYSry
[1,]           0         0                0               0                  0
[2,]           0         0                0               0                  0
[3,]           0         0                0               0                  0
[4,]           0         0                0               0                  0
[5,]           0         0                0               0                  0
[6,]           0         0                0               0                  0
     beta-genotypeXXSry tvalue-(Intercept) tvalue-genotypeXY
[1,]                  0                  0                 0
[2,]                  0                  0                 0
[3,]                  0                  0                 0
[4,]                  0                  0                 0
[5,]                  0                  0                 0
[6,]                  0                  0                 0
     tvalue-genotypeXYSry tvalue-genotypeXXSry logLik
[1,]                    0                    0      0
[2,]                    0                    0      0
[3,]                    0                    0      0
[4,]                    0                    0      0
[5,]                    0                    0      0
[6,]                    0                    0      0

dim(vs_genotype)

[1] 746045     11

So the results from mincLm seems to include the beta coefficients (differences) in log_full_determinants between XX and the other genotypes at baseline (XX is not present, so it’s serving as the reference variable for all comparisons), and t-statistics associated with the difference between XX and the other genotypes.

If you want to see whether XY is different from the other genotypes, for example, you would need to refactor the data. Sample code below (this code is not run).

gf %>% genotype = factor(Genotype, levels=c("XY", "XX", "XYSry", "XXSry"))

There are 746,045 voxels where these statistics are available. That’s too many comparisons, and we run the risk of having voxels that are not actually different between the groups deemed significant (false positives). Multiple comparison adjustments need to be done.

This is where mincFDR comes in. mincFDR takes a mincLm object as an input, and then (I think) it: 1) converts the t-values in mincLm to p-values and 2) applies the p.adjust function to all voxels, taking into account how many voxels there are in total.

Let’s apply mincFDR to our simple model and see its outputs. I’m calling it vs_genotype_qvals because after FDR adjustments, p-values are called q-values.

vs_genotype_qvals <- mincFDR(vs_genotype)


Computing FDR threshold for all columns
  Computing threshold for  F-statistic 
  Computing threshold for  tvalue-(Intercept) 
  Computing threshold for  tvalue-genotypeXY 
  Computing threshold for  tvalue-genotypeXYSry 
  Computing threshold for  tvalue-genotypeXXSry

head(vs_genotype_qvals)

     qvalue-F-statistic qvalue-tvalue-(Intercept) qvalue-tvalue-genotypeXY
[1,]                  1                         1                        1
[2,]                  1                         1                        1
[3,]                  1                         1                        1
[4,]                  1                         1                        1
[5,]                  1                         1                        1
[6,]                  1                         1                        1
     qvalue-tvalue-genotypeXYSry qvalue-tvalue-genotypeXXSry
[1,]                           1                           1
[2,]                           1                           1
[3,]                           1                           1
[4,]                           1                           1
[5,]                           1                           1
[6,]                           1                           1

dim(vs_genotype_qvals)

[1] 746045      5

So you get a matrix with the same number of voxels (rows) as your input model and some columns labeled qvalue-tvalue-genotype. I suspect that these are the going to be the adjusted p-values (or q-values) you’ll want to visualize on your atlases!

Let’s just double check that’s the case. I’m checking the range of these q-value columns. If they’re all between 0 and 1, that would mean they are in fact adjusted p-values.

# apply the function (range) to all columns in vs_genotype_qvals 
sapply(1:ncol(vs_genotype_qvals), function(x) range(vs_genotype_qvals[,x]))

             [,1]         [,2]      [,3]         [,4]        [,5]
[1,] 0.0006884425 2.438496e-07 0.0061084 9.327254e-05 0.006208024
[2,] 1.0000000000 1.000000e+00 1.0000000 1.000000e+00 1.000000000

Nice! They’re all 0 - 1. That means they’re all adjusted p-values, and there is a way to visualize these on your atlases.

Visualizations

To do our visualizations, we need a couple more things. First, a template image so the statistics visualizations can be laid on top of it. Here we’re using the average nlin-3 image from the pipeline.

nlin <- mincArray(mincGetVolume("/well/lerch/users/dqp841/FCG_Rat_Stroke/pipelines/FCGRat-t2-2024-10-03_nlin/FCGRat-t2-2024-10-03-nlin-3.mnc"))

dim(nlin)

[1]  85 131  67

Then, the atlas labels of the data.

# using the 'mask' file again because it's actually the labelled atlas. 
labelVol <- mincArray(mincGetVolume(mask))

dim(labelVol)

[1]  85 131  67

Notice when I print the dimensions of nlin and labelVol, they are exactly the same. That’s because nlin is the nlin-3 image, and labelVol is the labels of the nlin-3 image (nlin-3_voted).

Both files have 85 slices in the x-axis, 131 slices in the y-axis, and 67 slices in the z-axis.

Let’s look at an example slice from these data matrices to see what information they’re storing.

# pull a sample of the matrix representing voxels in slice 30 of z-axis in nlin and labelVol
nlin[, ,30 ][35,]

  [1]   75.00039   93.25433  126.19213  167.79124  203.70927  221.99425
  [7]  277.62530  472.76857  981.20899 1431.31753 1611.21815 1604.14009
 [13] 1378.85299 1197.64852 1150.36834 1136.08805 1074.49654 1026.50235
 [19] 1050.93406 1037.64718 1036.77795 1168.96376 1372.70625 1338.99235
 [25]  945.19782  782.15416  816.70626  794.91329  866.19057  873.33071
 [31]  800.22183  765.54556  851.78610  973.07543 1016.59928 1039.19939
 [37] 1044.16644  989.55986  983.94087  985.46203  904.62317  829.18599
 [43]  797.14847  777.87007  751.04796  730.83825  735.61904  740.74132
 [49]  757.10157  751.88615  716.68213  717.05466  715.12993  727.42340
 [55]  749.46471  710.93897  838.46818  928.96175  868.42574  861.40977
 [61]  840.20665  840.88962  848.68169  839.02697  828.31675  831.07968
 [67]  814.65734  826.67141  833.50112  780.81926  807.79660  881.12278
 [73]  975.62105  972.70290  955.75282  999.27666 1022.46661 1055.68381
 [79] 1064.81078 1067.97728 1076.60754 1084.08917 1078.43914 1085.17571
 [85] 1079.71195 1092.71943 1156.51508 1161.32691 1090.39112  970.74713
 [91]  888.91485  941.00687 1075.98666 1177.59403 1177.06628 1115.81625
 [97]  869.32602  508.22094  289.85667  218.88984  196.69330  207.12412
[103]  217.05824  235.71575  254.12490  275.94891  265.26974  252.44852
[109]  213.79861  211.87387  252.69688  269.02608  292.27812  394.59950
[115]  490.77415  521.97348  566.58387  563.13797  528.36857  500.33574
[121]  475.03479  482.54746  452.93138  412.41882  376.87331  362.15841
[127]  335.18107  318.94501  273.09286  213.98487  126.03691

labelVol[, ,30][35,]

  [1]   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0  29  29  29
 [19]  29  29  29   0   0   0   7 193   7 189 189 189 189   7 189 189 189 189
 [37] 208 208 208 208 208 208 208 208 208 208 208 208 208 208 208 208 208 156
 [55] 156 208 156 156 156 156 156 156 156 156 156 156 156 156 156 156  45  45
 [73]  78  78  50 243  50  50  50  50  50  50  47  47  47  47  95  41  41  41
 [91]  41  41  41  41  41  41   0   0   0   0   0   0   0   0   0   0   0   0
[109]   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
[127]   0   0   0   0   0

I’ve pulled out 129 values each from nlin and labelVol. Each of these values are information contained within a subset of voxels at slice 30 in the z-axis.

In nlin, the voxels are decimal numbers representing image intensity.

In labelVol, the voxels are whole numbers representing an atlas label (i.e. 1 is Hippocampus. I’m just making up the numbers and their associations. Actual associations are likely different).

With that in mind, I’m gonna annotate the following visualization:

# start a sliceSeries, with 1 row and 5 columns. use slices from the z-axis (default), and plot 5 slices (5 columns) from slices 30 through 90. 
sliceSeries(nrow=1, ncol=5, begin=30, end=90) %>% 

# overlay the anatomy image. in our case, is the nlin-3 image. plot voxels that are between 700 to 1400 in their image intensities. 
  anatomy(nlin, low=700, high=1400) %>%
  
# overlay the statistics. see explanation below the figure that will be generated
  overlay(-log10(mincArray(vs_genotype_qvals, "qvalue-tvalue-genotypeXXSry")) * (labelVol>0.5) * sign(mincArray(vs_genotype, "tvalue-genotypeXXSry")), low=0.1, high=2, symmetric=T) %>% 

# add a title
  addtitle("XXSry abs") %>% 

# add a legend 
  legend("-log10 q-value") %>% 

# draw the figure! 
  draw()

The statistics overlay

As a reminder, this was our statistics overlay code for the visualization.

overlay(-log10(mincArray(vs_genotype_qvals, "qvalue-tvalue-genotypeXXSry")) * (labelVol>0.5) * sign(mincArray(vs_genotype, "tvalue-genotypeXXSry")), low=0.1, high=2, symmetric=T)

To explain how this works , let’s break down how it’s calculated first. I’m still pulling a sample from slice 30 of z-dimension to be consistent.

# step 1: calculate -log10 of q-values at every voxels. log10 is multipled by negative 1 because log10 gives negative values for anything less than 1. Negative q-values don't make sense. Applying log10 to the q-value is just scaling them up to make visualization easier (seeing the difference between 0 and 2 is easier than 0 and 1) 
log10_q <- -log10(mincArray(vs_genotype_qvals, "qvalue-tvalue-genotypeXXSry"))[, ,30][35,]

log10_q

  [1] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
  [7] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [13] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [19] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [25] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [31] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [37] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [43] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [49] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [55] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [61] 0.20750872 0.35157189 0.44689661 0.39531659 0.29839121 0.28133185
 [67] 0.21447423 0.07332705 0.13295974 0.25733193 0.47530662 0.61956803
 [73] 0.74616107 0.54277334 0.25480405 0.05685436 0.00000000 0.00000000
 [79] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [85] 0.09534905 0.44918051 0.78955389 0.86610506 0.90096597 0.76349428
 [91] 0.40968016 0.23513449 0.01712954 0.00000000 0.00000000 0.00000000
 [97] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[103] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[109] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[115] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[121] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[127] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000

# step 2: identify if a voxel is actually inside a region of interest/inside the brain using the following code. in R, TRUE is coded as 1 and FALSE is coded as 0. so imagine the output is a matrix of TRUE/FALSE is also read as a 1/0 matrix. if a voxel is inside the brain/associated with a brain region, it would have an atlas label > 0. hence why this works. 
label_in_brain <- labelVol[, ,30][35,]>0.5
label_in_brain

  [1] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
 [13] FALSE FALSE FALSE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE FALSE FALSE FALSE
 [25]  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE
 [37]  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE
 [49]  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE
 [61]  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE
 [73]  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE
 [85]  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE  TRUE
 [97] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
[109] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
[121] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE

# step 3: identify the signs associated with t-values in the voxels. negative means it's lesser volume in the XXSry genotype compared to XX. positive means it's greater in volume for XXSry genotype compared to XX. 
tvals_sign <- sign(mincArray(vs_genotype, "tvalue-genotypeXXSry")[, ,30][35,])
tvals_sign

  [1]  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  1  1  1  1  1  1  0  0  0  1
 [26]  1  1  1  1  1  1  1  1  1  1  1 -1 -1 -1  1  1  1  1  1  1  1  1  1  1  1
 [51]  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1
 [76]  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  1  0  0  0  0
[101]  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0
[126]  0  0  0  0  0  0

# notice these vectors have the same dimension. we're referring to the same voxels across all of these vectors.
length(log10_q)

[1] 131

length(label_in_brain)

[1] 131

length(tvals_sign)

[1] 131

# step 4: multiply all three vectors together. these are the values plotted on your visualization. 
sample_qvals_on_map <- log10_q * label_in_brain * tvals_sign
sample_qvals_on_map

  [1] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
  [7] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [13] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [19] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [25] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [31] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [37] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [43] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [49] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [55] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [61] 0.20750872 0.35157189 0.44689661 0.39531659 0.29839121 0.28133185
 [67] 0.21447423 0.07332705 0.13295974 0.25733193 0.47530662 0.61956803
 [73] 0.74616107 0.54277334 0.25480405 0.05685436 0.00000000 0.00000000
 [79] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
 [85] 0.09534905 0.44918051 0.78955389 0.86610506 0.90096597 0.76349428
 [91] 0.40968016 0.23513449 0.01712954 0.00000000 0.00000000 0.00000000
 [97] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[103] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[109] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[115] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[121] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000
[127] 0.00000000 0.00000000 0.00000000 0.00000000 0.00000000

Notice that the range of these sample q-values

range(sample_qvals_on_map)

[1] 0.000000 0.900966

In this case, they’re from 0 to 0.91. Notice in the code, we had this bit `low=0.1, high=2, symmetric=T)` . Here, any voxel where the -log10 q-value is below 0.1 is not plotted, and anything beyond 2 is not plotted (assuming the smallest raw q-value is 0.01). If I increase the low threshold to 0.7

sliceSeries(nrow=1, ncol=5, begin=30, end=90) %>%
  anatomy(nlin, low=700, high=1400) %>%
  overlay(-log10(mincArray(vs_genotype_qvals, "qvalue-tvalue-genotypeXXSry")) * (labelVol>0.5) * sign(mincArray(vs_genotype, "tvalue-genotypeXXSry")), low=0.7, high=2, symmetric=T) %>% legend("-log10 q-value") %>%
  addtitle("XXSry abs") %>% draw()

Why do we want to have all of these modifications to the original q-values? If you play around with the code and take out (labelVol>0.5) * sign(mincArray(vs_genotype, "tvalue-genotypeXXSry"), your display will look exactly the same given the slices you’ve chosen.

But, if you choose different slices (say instead of between 30 - 90, you pick somewhere closer to beginning or final slices i.e. 110 - 131), where the data is noisier, I would suspect you’ll get more inaccurate display of your stats. Just play around and see if this is true!

I hope this will help you have more flexibility in using RMINC to visualize your results.