Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*The author of this computation has been verified*

R Software Module

rwasp_boxcoxnorm.wasp

Title produced by software

Box-Cox Normality Plot

Date of computation

Wed, 12 Nov 2008 05:32:59 -0700

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/12/t12264932480uwzlyq824o1gwp.htm/, Retrieved Fri, 04 Jul 2025 09:57:25 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=24154, Retrieved Fri, 04 Jul 2025 09:57:25 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

Estimated Impact

261

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

F       [Box-Cox Normality Plot] [Box-cox normality...] [2008-11-12 12:32:59] [a16dfd7e948381d8b6391003c5d09447] [Current]
F    D    [Box-Cox Normality Plot] [opdracht3_Q4] [2008-11-12 15:45:57] [3f66c6f083b1153972739491b89fa2dd] 

Feedback Forum

2008-11-16 17:37:08 [006ad2c49b6a7c2ad6ab685cfc1dae56] [reply] 
Het doel van deze box-cox normality is om de data te transformeren zodat ze normaler verdeeld worden. Net zoals bij de box-cox linearity plot wordt er een maximum bereikt bij de optimale lambda-waarde voor transformatie. In dit geval is de grafiek maximaal bij -2 maar dit kan je niet als een maximum beschouwen omdat de grafiek voor lambda=-2 duidelijk nog stijgt. Om het maximum te kennen moet je dus een andere grafieken maken met kleinere lambda-waarden. De transformatie heeft een zeer sterk effect. De grafiek is helemaal veranderd.
2008-11-20 12:42:58 [Carole Thielens] [reply] 
De interpretatie van de studente met betrekking tot deze vraag, is helemaal correct. Wat uitleg over de grafieken zelf, is hier opnieuw welkom. 
Een Box-cox transformatie kan leiden tot een meer uitgesproken normaalverdeling. Zo kunnen statistische technieken toegepast worden op de gegevens die gebaseerd zijn op de veronderstelling dat er sprake is van een normaalverdeling. Op de verticale as van de Box-cox normality plot wordt de correlatiecoëfficiënt na transformatie weergegeven, terwijl lambda op de x-as staat.  
Wanneer de histogrammen van de originele en de getransformeerde data met elkaar vergeleken worden, kan besloten worden dat deze transformatie helemaal niet leidde tot een normaalverdeling. Integendeel, wat oorspronkelijk een normaalverdeling benaderde, veranderde in een rechtsscheve verdeling. Dit wordt ook duidelijk op de normal Q-Q plot, waar de punten veel verder van de rechte komen te liggen na de transformatie. 
2008-11-22 10:41:17 [Stephanie Vanderlinden] [reply] 
De conclusie is correct, de uitleg over de grafieken ontbreekt. We gebruiken de box-cox transformatie om tot een meer uitgesproken normaalverdeling te komen. Hier bekomen we echter geen betere normaalverdeling. Als de transformatie is toegepast, bekomen we een rechtsscheve verdeling. De originele grafiek benaderde de normaalverdeling veel beter. We kunnen dus zeggen dat de transformatie een groot effect heeft op de gegevens aangezien de grafiek nu vertekend is.

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

6,6
6,4
6
5,8
5,5
5,4
5,4
5,6
5,5
5,8
5,8
5,7
5,5
5,3
5,2
5,3
5,3
5
4,8
4,9
5,3
6
6,2
6,4
6,4
6,4
6,2
6,1
6
5,9
6,2
6,2
6,4
6,8
6,9
7
7
6,9
6,7
6,6
6,5
6,4
6,5
6,5
6,6
6,7
6,8
7,2
7,6
7,6
7,3
6,4
6,1
6,3
7,1
7,5
7,4
7,1
6,8
6,9
7,2
7,4
7,3
6,9
6,9
6,8
7,1
7,2
7,1
7
6,9
7
7,4
7,5
7,5
7,4
7,3
7
6,7
6,5
6,5
6,5
6,6
6,8
6,9
6,9
6,8
6,8
6,5
6,1
6
5,9
5,8
5,9
5,9
6,2
6,3
6,2
6
5,8
5,5
5,5
5,7
5,8
5,7

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	3 seconds
R Server	'Sir Ronald Aylmer Fisher' @ 193.190.124.24

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 3 seconds \tabularnewline
R Server & 'Sir Ronald Aylmer Fisher' @ 193.190.124.24 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24154&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]3 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'Sir Ronald Aylmer Fisher' @ 193.190.124.24[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24154&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24154&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	3 seconds
R Server	'Sir Ronald Aylmer Fisher' @ 193.190.124.24

Box-Cox Normality Plot
# observations x	105
maximum correlation	0.267595739045043
optimal lambda	-2

\begin{tabular}{lllllllll}
\hline
Box-Cox Normality Plot \tabularnewline
# observations x & 105 \tabularnewline
maximum correlation & 0.267595739045043 \tabularnewline
optimal lambda & -2 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24154&T=1

[TABLE]
[ROW][C]Box-Cox Normality Plot[/C][/ROW]
[ROW][C]# observations x[/C][C]105[/C][/ROW]
[ROW][C]maximum correlation[/C][C]0.267595739045043[/C][/ROW]
[ROW][C]optimal lambda[/C][C]-2[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24154&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24154&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Box-Cox Normality Plot
# observations x	105
maximum correlation	0.267595739045043
optimal lambda	-2

Figure 1

PNG link

Postscript link

PDF link

Figure 2

PNG link

Postscript link

PDF link

Figure 3

PNG link

Postscript link

PDF link

Figure 4

PNG link

Postscript link

PDF link

Figure 5

PNG link

Postscript link

PDF link

Parameters (Session):

Parameters (R input):

R code (references can be found in the software module):

n <- length(x)
c <- array(NA,dim=c(401))
l <- array(NA,dim=c(401))
mx <- 0
mxli <- -999
for (i in 1:401)
{
l[i] <- (i-201)/100
if (l[i] != 0)
{
x1 <- (x^l[i] - 1) / l[i]
} else {
x1 <- log(x)
}
c[i] <- cor(qnorm(ppoints(x), mean=0, sd=1),x1)
if (mx < c[i])
{
mx <- c[i]
mxli <- l[i]
}
}
c
mx
mxli
if (mxli != 0)
{
x1 <- (x^mxli - 1) / mxli
} else {
x1 <- log(x)
}
bitmap(file='test1.png')
plot(l,c,main='Box-Cox Normality Plot',xlab='Lambda',ylab='correlation')
mtext(paste('Optimal Lambda =',mxli))
grid()
dev.off()
bitmap(file='test2.png')
hist(x,main='Histogram of Original Data',xlab='X',ylab='frequency')
grid()
dev.off()
bitmap(file='test3.png')
hist(x1,main='Histogram of Transformed Data',xlab='X',ylab='frequency')
grid()
dev.off()
bitmap(file='test4.png')
qqnorm(x)
qqline(x)
grid()
mtext('Original Data')
dev.off()
bitmap(file='test5.png')
qqnorm(x1)
qqline(x1)
grid()
mtext('Transformed Data')
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Box-Cox Normality Plot',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'# observations x',header=TRUE)
a<-table.element(a,n)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'maximum correlation',header=TRUE)
a<-table.element(a,mx)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'optimal lambda',header=TRUE)
a<-table.element(a,mxli)
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code