Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*Unverified author*

R Software Module

rwasp_regression_trees.wasp

Title produced by software

Recursive Partitioning (Regression Trees)

Date of computation

Wed, 26 May 2010 11:26:45 +0000

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2010/May/26/t1274873276vj2ucpnjh4tu3fn.htm/, Retrieved Thu, 03 Jul 2025 09:05:05 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=76458, Retrieved Thu, 03 Jul 2025 09:05:05 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

B521,regression tree,steven,coomans,thesis,per2maand

Estimated Impact

216

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

-       [Recursive Partitioning (Regression Trees)] [B521,regression t...] [2010-05-26 11:26:45] [d41d8cd98f00b204e9800998ecf8427e] [Current]

Feedback Forum

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

341.25	NA	333.775928269088	340.90875028286	381,75
303.6875	341.25	338.492504687478	332.86089603337	417,75
357.5	337.49375	316.528510995532	325.788548906474	359,25
295.075	339.494375	342.383916967292	327.541311770725	336
386.5755	335.0524375	312.529223276755	332.035063343418	361,8
455.6625	340.20474375	359.256754833345	355.979142780996	420,275
424.926	351.750519375	420.094418776242	422.853279739822	400,9
506.751	359.0680674375	423.143429126151	447.538297256219	248,15
433.9	373.83636069375	475.904696528962	463.452634199947	480,125
466.3375	379.842724624375	449.397276177252	469.038817658043	441,4
496.7	388.492202161938	460.087548237949	454.838864911353	483
464.45	399.312981945744	483.19214675098	483.073307566238	477,75
385.375	405.826683751169	471.364755928644	464.126997390433	486,25
381.875	403.781515376053	417.100190422978	429.475306145968	433,875
219.6375	401.590863838447	394.871034955396	366.780195970248	286,4
268.975	383.395527454603	284.288425042335	314.305881579131	231,875
292.2875	371.953474709142	274.624758163045	216.573154866353	308,65
181.025	363.986877238228	285.770981350566	228.421842054758	253,025
277.625	345.690689514405	219.670140557742	233.11741103145	201,025
166.75	338.884120562965	256.243046627923	200.091184304044	189,775
266	321.670708506668	199.767699784603	235.499484918374	237,375
189.25	316.103637656002	241.56415512763	219.691637200487	171,125
226.35	303.418273890401	208.550863403060	204.478173107192	212,4
158.75	295.711446501361	219.783159661820	216.921637949675	143,1
218.8125	282.015301851225	181.267666104251	235.210210182817	205,125

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	3 seconds
R Server	'Sir Ronald Aylmer Fisher' @ 193.190.124.24
R Framework error message	Warning: there are blank lines in the 'Data X' field. Please, use NA for missing data - blank lines are simply deleted and are NOT treated as missing values.

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 3 seconds \tabularnewline
R Server & 'Sir Ronald Aylmer Fisher' @ 193.190.124.24 \tabularnewline
R Framework error message & Warning: there are blank lines in the 'Data X' field.
Please, use NA for missing data - blank lines are simply
 deleted and are NOT treated as missing values. \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=76458&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]3 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'Sir Ronald Aylmer Fisher' @ 193.190.124.24[/C][/ROW]
[ROW][C]R Framework error message[/C][C]Warning: there are blank lines in the 'Data X' field.
Please, use NA for missing data - blank lines are simply
 deleted and are NOT treated as missing values.[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=76458&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=76458&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	3 seconds
R Server	'Sir Ronald Aylmer Fisher' @ 193.190.124.24
R Framework error message	Warning: there are blank lines in the 'Data X' field. Please, use NA for missing data - blank lines are simply deleted and are NOT treated as missing values.

Model Performance
#	Complexity	split	relative error	CV error	CV S.D.
1	0.603	0	1	1.13	0.2
2	0.01	1	0.397	1.01	0.285

\begin{tabular}{lllllllll}
\hline
Model Performance \tabularnewline
# & Complexity & split & relative error & CV error & CV S.D. \tabularnewline
1 & 0.603 & 0 & 1 & 1.13 & 0.2 \tabularnewline
2 & 0.01 & 1 & 0.397 & 1.01 & 0.285 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=76458&T=1

[TABLE]
[ROW][C]Model Performance[/C][/ROW]
[ROW][C]#[/C][C]Complexity[/C][C]split[/C][C]relative error[/C][C]CV error[/C][C]CV S.D.[/C][/ROW]
[ROW][C]1[/C][C]0.603[/C][C]0[/C][C]1[/C][C]1.13[/C][C]0.2[/C][/ROW]
[ROW][C]2[/C][C]0.01[/C][C]1[/C][C]0.397[/C][C]1.01[/C][C]0.285[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=76458&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=76458&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Model Performance
#	Complexity	split	relative error	CV error	CV S.D.
1	0.603	0	1	1.13	0.2
2	0.01	1	0.397	1.01	0.285

Figure 1

PNG link

Postscript link

PDF link

Figure 2

PNG link

Postscript link

PDF link

Figure 3

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = 1 ; par2 = No ;

Parameters (R input):

par1 = 1 ; par2 = No ;

R code (references can be found in the software module):

library(rpart)
library(partykit)
par1 <- as.numeric(par1)
autoprune <- function ( tree, method='Minimum CV'){
xerr <- tree$cptable[,'xerror']
cpmin.id <- which.min(xerr)
if (method == 'Minimum CV Error plus 1 SD'){
xstd <- tree$cptable[,'xstd']
errt <- xerr[cpmin.id] + xstd[cpmin.id]
cpSE1.min <- which.min( errt < xerr )
mycp <- (tree$cptable[,'CP'])[cpSE1.min]
}
if (method == 'Minimum CV') {
mycp <- (tree$cptable[,'CP'])[cpmin.id]
}
return (mycp)
}
conf.multi.mat <- function(true, new)
{
if ( all( is.na(match( levels(true),levels(new) ) )) )
stop ( 'conflict of vector levels')
multi.t <- list()
for (mylev in levels(true) ) {
true.tmp <- true
new.tmp <- new
left.lev <- levels (true.tmp)[- match(mylev,levels(true) ) ]
levels(true.tmp) <- list ( mylev = mylev, all = left.lev )
levels(new.tmp)  <- list ( mylev = mylev, all = left.lev )
curr.t <- conf.mat ( true.tmp , new.tmp )
multi.t[[mylev]] <- curr.t
multi.t[[mylev]]$precision <-
round( curr.t$conf[1,1] / sum( curr.t$conf[1,] ), 2 )
}
return (multi.t)
}
x <- t(y)
k <- length(x[1,])
n <- length(x[,1])
x1 <- cbind(x[,par1], x[,1:k!=par1])
mycolnames <- c(colnames(x)[par1], colnames(x)[1:k!=par1])
colnames(x1) <- mycolnames #colnames(x)[par1]
m <- rpart(as.data.frame(x1))
par2
if (par2 != 'No') {
mincp <- autoprune(m,method=par2)
print(mincp)
m <- prune(m,cp=mincp)
}
m$cptable
bitmap(file='test1.png')
plot(as.party(m),tp_args=list(id=FALSE))
dev.off()
bitmap(file='test2.png')
plotcp(m)
dev.off()
cbind(y=m$y,pred=predict(m),res=residuals(m))
myr <- residuals(m)
myp <- predict(m)
bitmap(file='test4.png')
op <- par(mfrow=c(2,2))
plot(myr,ylab='residuals')
plot(density(myr),main='Residual Kernel Density')
plot(myp,myr,xlab='predicted',ylab='residuals',main='Predicted vs Residuals')
plot(density(myp),main='Prediction Kernel Density')
par(op)
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Model Performance',6,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'#',header=TRUE)
a<-table.element(a,'Complexity',header=TRUE)
a<-table.element(a,'split',header=TRUE)
a<-table.element(a,'relative error',header=TRUE)
a<-table.element(a,'CV error',header=TRUE)
a<-table.element(a,'CV S.D.',header=TRUE)
a<-table.row.end(a)
for (i in 1:length(m$cptable[,1])) {
a<-table.row.start(a)
a<-table.element(a,i,header=TRUE)
a<-table.element(a,round(m$cptable[i,'CP'],3))
a<-table.element(a,m$cptable[i,'nsplit'])
a<-table.element(a,round(m$cptable[i,'rel error'],3))
a<-table.element(a,round(m$cptable[i,'xerror'],3))
a<-table.element(a,round(m$cptable[i,'xstd'],3))
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code