Home » date » 2010 » Dec » 12 »

WS10 RP

*The author of this computation has been verified*
R Software Module: /rwasp_regression_trees1.wasp (opens new window with default values)
Title produced by software: Recursive Partitioning (Regression Trees)
Date of computation: Sun, 12 Dec 2010 20:07:21 +0000
 
Cite this page as follows:
Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL http://www.freestatistics.org/blog/date/2010/Dec/12/t1292184340ykgkveojywlmh0u.htm/, Retrieved Sun, 12 Dec 2010 21:05:40 +0100
 
BibTeX entries for LaTeX users:
@Manual{KEY,
    author = {{YOUR NAME}},
    publisher = {Office for Research Development and Education},
    title = {Statistical Computations at FreeStatistics.org, URL http://www.freestatistics.org/blog/date/2010/Dec/12/t1292184340ykgkveojywlmh0u.htm/},
    year = {2010},
}
@Manual{R,
    title = {R: A Language and Environment for Statistical Computing},
    author = {{R Development Core Team}},
    organization = {R Foundation for Statistical Computing},
    address = {Vienna, Austria},
    year = {2010},
    note = {{ISBN} 3-900051-07-0},
    url = {http://www.R-project.org},
}
 
Original text written by user:
 
IsPrivate?
No (this computation is public)
 
User-defined keywords:
 
Dataseries X:
» Textbox « » Textfile « » CSV «
6 73 62 66 4 58 54 54 5 68 41 82 4 62 49 61 4 65 49 65 6 81 72 77 6 73 78 66 4 64 58 66 4 68 58 66 6 51 23 48 4 68 39 57 6 61 63 80 5 69 46 60 4 73 58 70 6 61 39 85 3 62 44 59 5 63 49 72 6 69 57 70 4 47 76 74 6 66 63 70 2 58 18 51 7 63 40 70 5 69 59 71 2 59 62 72 4 59 70 50 4 63 65 69 6 65 56 73 6 65 45 66 5 71 57 73 6 60 50 58 6 81 40 78 4 67 58 83 6 66 49 76 6 62 49 77 6 63 27 79 2 73 51 71 4 55 75 79 5 59 65 60 3 64 47 73 7 63 49 70 5 64 65 42 3 73 61 74 8 54 46 68 8 76 69 83 5 74 55 62 6 63 78 79 3 73 58 61 5 67 34 86 4 68 67 64 5 66 45 75 5 62 68 59 6 71 49 82 5 63 19 61 6 75 72 69 6 77 59 60 4 62 46 59 8 74 56 81 6 67 45 65 4 56 53 60 6 60 67 60 5 58 73 45 5 65 46 75 6 49 70 84 6 61 38 77 6 66 54 64 6 64 46 54 6 65 46 72 6 46 45 56 7 65 47 67 4 81 25 81 4 72 63 73 3 65 46 67 6 74 69 72 5 59 43 69 5 69 49 71 3 58 39 77 5 71 65 63 4 79 54 49 3 68 50 74 7 66 42 76 4 62 45 65 4 69 50 65 5 63 55 69 6 62 38 71 2 61 40 68 2 65 etc...
 
Output produced by software:

Enter (or paste) a matrix (table) containing all data (time) series. Every column represents a different variable and must be delimited by a space or Tab. Every row represents a period in time (or category) and must be delimited by hard returns. The easiest way to enter data is to copy and paste a block of spreadsheet cells. Please, do not use commas or spaces to seperate groups of digits!


Summary of computational transaction
Raw Inputview raw input (R code)
Raw Outputview raw output of R engine
Computing time5 seconds
R Server'Sir Ronald Aylmer Fisher' @ 193.190.124.24


Goodness of Fit
CorrelationNA
R-squaredNA
RMSE1.3897


Actuals, Predictions, and Residuals
#ActualsForecastsResiduals
165.143835616438360.856164383561643
245.14383561643836-1.14383561643836
355.14383561643836-0.143835616438357
445.14383561643836-1.14383561643836
545.14383561643836-1.14383561643836
665.143835616438360.856164383561643
765.143835616438360.856164383561643
845.14383561643836-1.14383561643836
945.14383561643836-1.14383561643836
1065.143835616438360.856164383561643
1145.14383561643836-1.14383561643836
1265.143835616438360.856164383561643
1355.14383561643836-0.143835616438357
1445.14383561643836-1.14383561643836
1565.143835616438360.856164383561643
1635.14383561643836-2.14383561643836
1755.14383561643836-0.143835616438357
1865.143835616438360.856164383561643
1945.14383561643836-1.14383561643836
2065.143835616438360.856164383561643
2125.14383561643836-3.14383561643836
2275.143835616438361.85616438356164
2355.14383561643836-0.143835616438357
2425.14383561643836-3.14383561643836
2545.14383561643836-1.14383561643836
2645.14383561643836-1.14383561643836
2765.143835616438360.856164383561643
2865.143835616438360.856164383561643
2955.14383561643836-0.143835616438357
3065.143835616438360.856164383561643
3165.143835616438360.856164383561643
3245.14383561643836-1.14383561643836
3365.143835616438360.856164383561643
3465.143835616438360.856164383561643
3565.143835616438360.856164383561643
3625.14383561643836-3.14383561643836
3745.14383561643836-1.14383561643836
3855.14383561643836-0.143835616438357
3935.14383561643836-2.14383561643836
4075.143835616438361.85616438356164
4155.14383561643836-0.143835616438357
4235.14383561643836-2.14383561643836
4385.143835616438362.85616438356164
4485.143835616438362.85616438356164
4555.14383561643836-0.143835616438357
4665.143835616438360.856164383561643
4735.14383561643836-2.14383561643836
4855.14383561643836-0.143835616438357
4945.14383561643836-1.14383561643836
5055.14383561643836-0.143835616438357
5155.14383561643836-0.143835616438357
5265.143835616438360.856164383561643
5355.14383561643836-0.143835616438357
5465.143835616438360.856164383561643
5565.143835616438360.856164383561643
5645.14383561643836-1.14383561643836
5785.143835616438362.85616438356164
5865.143835616438360.856164383561643
5945.14383561643836-1.14383561643836
6065.143835616438360.856164383561643
6155.14383561643836-0.143835616438357
6255.14383561643836-0.143835616438357
6365.143835616438360.856164383561643
6465.143835616438360.856164383561643
6565.143835616438360.856164383561643
6665.143835616438360.856164383561643
6765.143835616438360.856164383561643
6865.143835616438360.856164383561643
6975.143835616438361.85616438356164
7045.14383561643836-1.14383561643836
7145.14383561643836-1.14383561643836
7235.14383561643836-2.14383561643836
7365.143835616438360.856164383561643
7455.14383561643836-0.143835616438357
7555.14383561643836-0.143835616438357
7635.14383561643836-2.14383561643836
7755.14383561643836-0.143835616438357
7845.14383561643836-1.14383561643836
7935.14383561643836-2.14383561643836
8075.143835616438361.85616438356164
8145.14383561643836-1.14383561643836
8245.14383561643836-1.14383561643836
8355.14383561643836-0.143835616438357
8465.143835616438360.856164383561643
8525.14383561643836-3.14383561643836
8625.14383561643836-3.14383561643836
8765.143835616438360.856164383561643
8845.14383561643836-1.14383561643836
8955.14383561643836-0.143835616438357
9065.143835616438360.856164383561643
9175.143835616438361.85616438356164
9285.143835616438362.85616438356164
9365.143835616438360.856164383561643
9465.143835616438360.856164383561643
9535.14383561643836-2.14383561643836
9675.143835616438361.85616438356164
9735.14383561643836-2.14383561643836
9865.143835616438360.856164383561643
9945.14383561643836-1.14383561643836
10045.14383561643836-1.14383561643836
10165.143835616438360.856164383561643
10265.143835616438360.856164383561643
10365.143835616438360.856164383561643
10445.14383561643836-1.14383561643836
10575.143835616438361.85616438356164
10655.14383561643836-0.143835616438357
10775.143835616438361.85616438356164
10845.14383561643836-1.14383561643836
10965.143835616438360.856164383561643
11065.143835616438360.856164383561643
11165.143835616438360.856164383561643
11255.14383561643836-0.143835616438357
11355.14383561643836-0.143835616438357
11465.143835616438360.856164383561643
11575.143835616438361.85616438356164
11645.14383561643836-1.14383561643836
11745.14383561643836-1.14383561643836
11885.143835616438362.85616438356164
11965.143835616438360.856164383561643
12035.14383561643836-2.14383561643836
12145.14383561643836-1.14383561643836
12255.14383561643836-0.143835616438357
12355.14383561643836-0.143835616438357
12465.143835616438360.856164383561643
12585.143835616438362.85616438356164
12625.14383561643836-3.14383561643836
12745.14383561643836-1.14383561643836
12875.143835616438361.85616438356164
12955.14383561643836-0.143835616438357
13065.143835616438360.856164383561643
13165.143835616438360.856164383561643
13245.14383561643836-1.14383561643836
13355.14383561643836-0.143835616438357
13465.143835616438360.856164383561643
13565.143835616438360.856164383561643
13665.143835616438360.856164383561643
13765.143835616438360.856164383561643
13855.14383561643836-0.143835616438357
13955.14383561643836-0.143835616438357
14065.143835616438360.856164383561643
14145.14383561643836-1.14383561643836
14265.143835616438360.856164383561643
14335.14383561643836-2.14383561643836
14465.143835616438360.856164383561643
14585.143835616438362.85616438356164
14645.14383561643836-1.14383561643836
 
Charts produced by software:
http://www.freestatistics.org/blog/date/2010/Dec/12/t1292184340ykgkveojywlmh0u/2aqkd1292184433.png (open in new window)
http://www.freestatistics.org/blog/date/2010/Dec/12/t1292184340ykgkveojywlmh0u/2aqkd1292184433.ps (open in new window)


http://www.freestatistics.org/blog/date/2010/Dec/12/t1292184340ykgkveojywlmh0u/3aqkd1292184433.png (open in new window)
http://www.freestatistics.org/blog/date/2010/Dec/12/t1292184340ykgkveojywlmh0u/3aqkd1292184433.ps (open in new window)


http://www.freestatistics.org/blog/date/2010/Dec/12/t1292184340ykgkveojywlmh0u/4lzjg1292184433.png (open in new window)
http://www.freestatistics.org/blog/date/2010/Dec/12/t1292184340ykgkveojywlmh0u/4lzjg1292184433.ps (open in new window)


 
Parameters (Session):
par1 = 1 ; par2 = none ; par3 = 3 ; par4 = no ;
 
Parameters (R input):
par1 = 1 ; par2 = none ; par3 = 3 ; par4 = no ;
 
R code (references can be found in the software module):
library(party)
library(Hmisc)
par1 <- as.numeric(par1)
par3 <- as.numeric(par3)
x <- data.frame(t(y))
is.data.frame(x)
x <- x[!is.na(x[,par1]),]
k <- length(x[1,])
n <- length(x[,1])
colnames(x)[par1]
x[,par1]
if (par2 == 'kmeans') {
cl <- kmeans(x[,par1], par3)
print(cl)
clm <- matrix(cbind(cl$centers,1:par3),ncol=2)
clm <- clm[sort.list(clm[,1]),]
for (i in 1:par3) {
cl$cluster[cl$cluster==clm[i,2]] <- paste('C',i,sep='')
}
cl$cluster <- as.factor(cl$cluster)
print(cl$cluster)
x[,par1] <- cl$cluster
}
if (par2 == 'quantiles') {
x[,par1] <- cut2(x[,par1],g=par3)
}
if (par2 == 'hclust') {
hc <- hclust(dist(x[,par1])^2, 'cen')
print(hc)
memb <- cutree(hc, k = par3)
dum <- c(mean(x[memb==1,par1]))
for (i in 2:par3) {
dum <- c(dum, mean(x[memb==i,par1]))
}
hcm <- matrix(cbind(dum,1:par3),ncol=2)
hcm <- hcm[sort.list(hcm[,1]),]
for (i in 1:par3) {
memb[memb==hcm[i,2]] <- paste('C',i,sep='')
}
memb <- as.factor(memb)
print(memb)
x[,par1] <- memb
}
if (par2=='equal') {
ed <- cut(as.numeric(x[,par1]),par3,labels=paste('C',1:par3,sep=''))
x[,par1] <- as.factor(ed)
}
table(x[,par1])
colnames(x)
colnames(x)[par1]
x[,par1]
if (par2 == 'none') {
m <- ctree(as.formula(paste(colnames(x)[par1],' ~ .',sep='')),data = x)
}
load(file='createtable')
if (par2 != 'none') {
m <- ctree(as.formula(paste('as.factor(',colnames(x)[par1],') ~ .',sep='')),data = x)
if (par4=='yes') {
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'10-Fold Cross Validation',3+2*par3,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'',1,TRUE)
a<-table.element(a,'Prediction (training)',par3+1,TRUE)
a<-table.element(a,'Prediction (testing)',par3+1,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Actual',1,TRUE)
for (jjj in 1:par3) a<-table.element(a,paste('C',jjj,sep=''),1,TRUE)
a<-table.element(a,'CV',1,TRUE)
for (jjj in 1:par3) a<-table.element(a,paste('C',jjj,sep=''),1,TRUE)
a<-table.element(a,'CV',1,TRUE)
a<-table.row.end(a)
for (i in 1:10) {
ind <- sample(2, nrow(x), replace=T, prob=c(0.9,0.1))
m.ct <- ctree(as.formula(paste('as.factor(',colnames(x)[par1],') ~ .',sep='')),data =x[ind==1,])
if (i==1) {
m.ct.i.pred <- predict(m.ct, newdata=x[ind==1,])
m.ct.i.actu <- x[ind==1,par1]
m.ct.x.pred <- predict(m.ct, newdata=x[ind==2,])
m.ct.x.actu <- x[ind==2,par1]
} else {
m.ct.i.pred <- c(m.ct.i.pred,predict(m.ct, newdata=x[ind==1,]))
m.ct.i.actu <- c(m.ct.i.actu,x[ind==1,par1])
m.ct.x.pred <- c(m.ct.x.pred,predict(m.ct, newdata=x[ind==2,]))
m.ct.x.actu <- c(m.ct.x.actu,x[ind==2,par1])
}
}
print(m.ct.i.tab <- table(m.ct.i.actu,m.ct.i.pred))
numer <- 0
for (i in 1:par3) {
print(m.ct.i.tab[i,i] / sum(m.ct.i.tab[i,]))
numer <- numer + m.ct.i.tab[i,i]
}
print(m.ct.i.cp <- numer / sum(m.ct.i.tab))
print(m.ct.x.tab <- table(m.ct.x.actu,m.ct.x.pred))
numer <- 0
for (i in 1:par3) {
print(m.ct.x.tab[i,i] / sum(m.ct.x.tab[i,]))
numer <- numer + m.ct.x.tab[i,i]
}
print(m.ct.x.cp <- numer / sum(m.ct.x.tab))
for (i in 1:par3) {
a<-table.row.start(a)
a<-table.element(a,paste('C',i,sep=''),1,TRUE)
for (jjj in 1:par3) a<-table.element(a,m.ct.i.tab[i,jjj])
a<-table.element(a,round(m.ct.i.tab[i,i]/sum(m.ct.i.tab[i,]),4))
for (jjj in 1:par3) a<-table.element(a,m.ct.x.tab[i,jjj])
a<-table.element(a,round(m.ct.x.tab[i,i]/sum(m.ct.x.tab[i,]),4))
a<-table.row.end(a)
}
a<-table.row.start(a)
a<-table.element(a,'Overall',1,TRUE)
for (jjj in 1:par3) a<-table.element(a,'-')
a<-table.element(a,round(m.ct.i.cp,4))
for (jjj in 1:par3) a<-table.element(a,'-')
a<-table.element(a,round(m.ct.x.cp,4))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable3.tab')
}
}
m
bitmap(file='test1.png')
plot(m)
dev.off()
bitmap(file='test1a.png')
plot(x[,par1] ~ as.factor(where(m)),main='Response by Terminal Node',xlab='Terminal Node',ylab='Response')
dev.off()
if (par2 == 'none') {
forec <- predict(m)
result <- as.data.frame(cbind(x[,par1],forec,x[,par1]-forec))
colnames(result) <- c('Actuals','Forecasts','Residuals')
print(result)
}
if (par2 != 'none') {
print(cbind(as.factor(x[,par1]),predict(m)))
myt <- table(as.factor(x[,par1]),predict(m))
print(myt)
}
bitmap(file='test2.png')
if(par2=='none') {
op <- par(mfrow=c(2,2))
plot(density(result$Actuals),main='Kernel Density Plot of Actuals')
plot(density(result$Residuals),main='Kernel Density Plot of Residuals')
plot(result$Forecasts,result$Actuals,main='Actuals versus Predictions',xlab='Predictions',ylab='Actuals')
plot(density(result$Forecasts),main='Kernel Density Plot of Predictions')
par(op)
}
if(par2!='none') {
plot(myt,main='Confusion Matrix',xlab='Actual',ylab='Predicted')
}
dev.off()
if (par2 == 'none') {
detcoef <- cor(result$Forecasts,result$Actuals)
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Goodness of Fit',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Correlation',1,TRUE)
a<-table.element(a,round(detcoef,4))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'R-squared',1,TRUE)
a<-table.element(a,round(detcoef*detcoef,4))
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'RMSE',1,TRUE)
a<-table.element(a,round(sqrt(mean((result$Residuals)^2)),4))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable1.tab')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Actuals, Predictions, and Residuals',4,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'#',header=TRUE)
a<-table.element(a,'Actuals',header=TRUE)
a<-table.element(a,'Forecasts',header=TRUE)
a<-table.element(a,'Residuals',header=TRUE)
a<-table.row.end(a)
for (i in 1:length(result$Actuals)) {
a<-table.row.start(a)
a<-table.element(a,i,header=TRUE)
a<-table.element(a,result$Actuals[i])
a<-table.element(a,result$Forecasts[i])
a<-table.element(a,result$Residuals[i])
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable.tab')
}
if (par2 != 'none') {
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Confusion Matrix (predicted in columns / actuals in rows)',par3+1,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'',1,TRUE)
for (i in 1:par3) {
a<-table.element(a,paste('C',i,sep=''),1,TRUE)
}
a<-table.row.end(a)
for (i in 1:par3) {
a<-table.row.start(a)
a<-table.element(a,paste('C',i,sep=''),1,TRUE)
for (j in 1:par3) {
a<-table.element(a,myt[i,j])
}
a<-table.row.end(a)
}
a<-table.end(a)
table.save(a,file='mytable2.tab')
}
 





Copyright

Creative Commons License

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 License.

Software written by Ed van Stee & Patrick Wessa


Disclaimer

Information provided on this web site is provided "AS IS" without warranty of any kind, either express or implied, including, without limitation, warranties of merchantability, fitness for a particular purpose, and noninfringement. We use reasonable efforts to include accurate and timely information and periodically update the information, and software without notice. However, we make no warranties or representations as to the accuracy or completeness of such information (or software), and we assume no liability or responsibility for errors or omissions in the content of this web site, or any software bugs in online applications. Your use of this web site is AT YOUR OWN RISK. Under no circumstances and under no legal theory shall we be liable to you or any other person for any direct, indirect, special, incidental, exemplary, or consequential damages arising from your access to, or use of, this web site.


Privacy Policy

We may request personal information to be submitted to our servers in order to be able to:

  • personalize online software applications according to your needs
  • enforce strict security rules with respect to the data that you upload (e.g. statistical data)
  • manage user sessions of online applications
  • alert you about important changes or upgrades in resources or applications

We NEVER allow other companies to directly offer registered users information about their products and services. Banner references and hyperlinks of third parties NEVER contain any personal data of the visitor.

We do NOT sell, nor transmit by any means, personal information, nor statistical data series uploaded by you to third parties.

We carefully protect your data from loss, misuse, alteration, and destruction. However, at any time, and under any circumstance you are solely responsible for managing your passwords, and keeping them secret.

We store a unique ANONYMOUS USER ID in the form of a small 'Cookie' on your computer. This allows us to track your progress when using this website which is necessary to create state-dependent features. The cookie is used for NO OTHER PURPOSE. At any time you may opt to disallow cookies from this website - this will not affect other features of this website.

We examine cookies that are used by third-parties (banner and online ads) very closely: abuse from third-parties automatically results in termination of the advertising contract without refund. We have very good reason to believe that the cookies that are produced by third parties (banner ads) do NOT cause any privacy or security risk.

FreeStatistics.org is safe. There is no need to download any software to use the applications and services contained in this website. Hence, your system's security is not compromised by their use, and your personal data - other than data you submit in the account application form, and the user-agent information that is transmitted by your browser - is never transmitted to our servers.

As a general rule, we do not log on-line behavior of individuals (other than normal logging of webserver 'hits'). However, in cases of abuse, hacking, unauthorized access, Denial of Service attacks, illegal copying, hotlinking, non-compliance with international webstandards (such as robots.txt), or any other harmful behavior, our system engineers are empowered to log, track, identify, publish, and ban misbehaving individuals - even if this leads to ban entire blocks of IP addresses, or disclosing user's identity.


FreeStatistics.org is powered by