Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*Unverified author*

R Software Module

rwasp_regression_trees.wasp

Title produced by software

Recursive Partitioning (Regression Trees)

Date of computation

Wed, 26 May 2010 11:30:17 +0000

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2010/May/26/t1274873454fa9ly4x60c59rvd.htm/, Retrieved Fri, 03 May 2024 06:56:47 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=76459, Retrieved Fri, 03 May 2024 06:56:47 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

B580,regression tree,steven,coomans,thesis,per2maand

Estimated Impact

142

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

-       [Recursive Partitioning (Regression Trees)] [B580,regression t...] [2010-05-26 11:30:17] [d41d8cd98f00b204e9800998ecf8427e] [Current]

Feedback Forum

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

192	NA	201.992383256927	206.967782536146	212
212.25	192	197.374595777152	201.340376307467	219,9
191.8	194.025	204.248977354664	210.239313211865	225,9
163.7625	193.8025	198.495922223946	201.252485572609	204,05
272.025	190.7985	182.444540071016	188.931303179668	256,65
284.575	198.92115	223.842424405143	236.507656592625	260,375
301.6635	207.486535	251.908814573073	242.02280020498	221,75
287.5375	216.9042315	274.901984222098	249.532404318106	123,05
220.4375	223.96755835	280.741244492589	243.324681715068	277,375
178.3	223.614552515	252.873030359752	213.837340170125	244,4
284.8875	219.0830972635	218.410540574304	195.319860969677	223,8
283.9875	225.66353753715	249.131587099083	242.160129478196	310,9
238	231.495933783435	265.239575959338	241.764621171334	254,4
216.275	232.146340405092	252.651330534283	221.555245324865	254,625
162.875	230.559206364582	235.840709946355	212.008114250888	122,65
185.95	223.790785728124	202.121012334719	188.541288043735	117,775
193.7875	220.006707155312	194.647890420306	198.681681578005	187,15
128.3275	217.384786439781	194.250277557744	202.125899750263	195,65
83.925	208.479057795803	163.7853355103	173.359262231158	115,7
177.15	196.023652016222	126.879419477262	153.846420458440	144,375
142.3	194.1362868146	150.111000109239	194.814489244242	125,75
120.5375	188.95265813314	146.501296840105	179.499528695192	115,65
269.25	182.111142319826	134.502628164888	169.935918108429	193
167.625	190.825028087843	196.773530970922	235.288172646467	184,35
243.275	188.505025279059	183.303098751657	190.628692996618	194,325

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	4 seconds
R Server	'George Udny Yule' @ 72.249.76.132
R Framework error message	Warning: there are blank lines in the 'Data X' field. Please, use NA for missing data - blank lines are simply deleted and are NOT treated as missing values.

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 4 seconds \tabularnewline
R Server & 'George Udny Yule' @ 72.249.76.132 \tabularnewline
R Framework error message & Warning: there are blank lines in the 'Data X' field.
Please, use NA for missing data - blank lines are simply
 deleted and are NOT treated as missing values. \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=76459&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]4 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'George Udny Yule' @ 72.249.76.132[/C][/ROW]
[ROW][C]R Framework error message[/C][C]Warning: there are blank lines in the 'Data X' field.
Please, use NA for missing data - blank lines are simply
 deleted and are NOT treated as missing values.[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=76459&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=76459&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	4 seconds
R Server	'George Udny Yule' @ 72.249.76.132
R Framework error message	Warning: there are blank lines in the 'Data X' field. Please, use NA for missing data - blank lines are simply deleted and are NOT treated as missing values.

Model Performance
#	Complexity	split	relative error	CV error	CV S.D.
1	0.299	0	1	1.007	0.217
2	0.01	1	0.701	1.245	0.246

\begin{tabular}{lllllllll}
\hline
Model Performance \tabularnewline
# & Complexity & split & relative error & CV error & CV S.D. \tabularnewline
1 & 0.299 & 0 & 1 & 1.007 & 0.217 \tabularnewline
2 & 0.01 & 1 & 0.701 & 1.245 & 0.246 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=76459&T=1

[TABLE]
[ROW][C]Model Performance[/C][/ROW]
[ROW][C]#[/C][C]Complexity[/C][C]split[/C][C]relative error[/C][C]CV error[/C][C]CV S.D.[/C][/ROW]
[ROW][C]1[/C][C]0.299[/C][C]0[/C][C]1[/C][C]1.007[/C][C]0.217[/C][/ROW]
[ROW][C]2[/C][C]0.01[/C][C]1[/C][C]0.701[/C][C]1.245[/C][C]0.246[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=76459&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=76459&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Model Performance
#	Complexity	split	relative error	CV error	CV S.D.
1	0.299	0	1	1.007	0.217
2	0.01	1	0.701	1.245	0.246

Figure 1

PNG link

Postscript link

PDF link

Figure 2

PNG link

Postscript link

PDF link

Figure 3

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = 1 ; par2 = No ;

Parameters (R input):

par1 = 1 ; par2 = No ;

R code (references can be found in the software module):

library(rpart)
library(partykit)
par1 <- as.numeric(par1)
autoprune <- function ( tree, method='Minimum CV'){
xerr <- tree$cptable[,'xerror']
cpmin.id <- which.min(xerr)
if (method == 'Minimum CV Error plus 1 SD'){
xstd <- tree$cptable[,'xstd']
errt <- xerr[cpmin.id] + xstd[cpmin.id]
cpSE1.min <- which.min( errt < xerr )
mycp <- (tree$cptable[,'CP'])[cpSE1.min]
}
if (method == 'Minimum CV') {
mycp <- (tree$cptable[,'CP'])[cpmin.id]
}
return (mycp)
}
conf.multi.mat <- function(true, new)
{
if ( all( is.na(match( levels(true),levels(new) ) )) )
stop ( 'conflict of vector levels')
multi.t <- list()
for (mylev in levels(true) ) {
true.tmp <- true
new.tmp <- new
left.lev <- levels (true.tmp)[- match(mylev,levels(true) ) ]
levels(true.tmp) <- list ( mylev = mylev, all = left.lev )
levels(new.tmp)  <- list ( mylev = mylev, all = left.lev )
curr.t <- conf.mat ( true.tmp , new.tmp )
multi.t[[mylev]] <- curr.t
multi.t[[mylev]]$precision <-
round( curr.t$conf[1,1] / sum( curr.t$conf[1,] ), 2 )
}
return (multi.t)
}
x <- t(y)
k <- length(x[1,])
n <- length(x[,1])
x1 <- cbind(x[,par1], x[,1:k!=par1])
mycolnames <- c(colnames(x)[par1], colnames(x)[1:k!=par1])
colnames(x1) <- mycolnames #colnames(x)[par1]
m <- rpart(as.data.frame(x1))
par2
if (par2 != 'No') {
mincp <- autoprune(m,method=par2)
print(mincp)
m <- prune(m,cp=mincp)
}
m$cptable
bitmap(file='test1.png')
plot(as.party(m),tp_args=list(id=FALSE))
dev.off()
bitmap(file='test2.png')
plotcp(m)
dev.off()
cbind(y=m$y,pred=predict(m),res=residuals(m))
myr <- residuals(m)
myp <- predict(m)
bitmap(file='test4.png')
op <- par(mfrow=c(2,2))
plot(myr,ylab='residuals')
plot(density(myr),main='Residual Kernel Density')
plot(myp,myr,xlab='predicted',ylab='residuals',main='Predicted vs Residuals')
plot(density(myp),main='Prediction Kernel Density')
par(op)
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Model Performance',6,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'#',header=TRUE)
a<-table.element(a,'Complexity',header=TRUE)
a<-table.element(a,'split',header=TRUE)
a<-table.element(a,'relative error',header=TRUE)
a<-table.element(a,'CV error',header=TRUE)
a<-table.element(a,'CV S.D.',header=TRUE)
a<-table.row.end(a)
for (i in 1:length(m$cptable[,1])) {
a<-table.row.start(a)
a<-table.element(a,i,header=TRUE)
a<-table.element(a,round(m$cptable[i,'CP'],3))
a<-table.element(a,m$cptable[i,'nsplit'])
a<-table.element(a,round(m$cptable[i,'rel error'],3))
a<-table.element(a,round(m$cptable[i,'xerror'],3))
a<-table.element(a,round(m$cptable[i,'xstd'],3))
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code