Repository of Reproducible Computations

Free Statistics

of Irreproducible Research!

Author's title

Author

*Unverified author*

R Software Module

rwasp_bidensity.wasp

Title produced by software

Bivariate Kernel Density Estimation

Date of computation

Wed, 12 Nov 2008 11:30:35 -0700

Cite this page as follows

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?v=date/2008/Nov/12/t1226514673rzl2ach1t0ss2k5.htm/, Retrieved Mon, 30 Jun 2025 22:48:56 +0000

Statistical Computations at FreeStatistics.org, Office for Research Development and Education, URL https://freestatistics.org/blog/index.php?pk=24354, Retrieved Mon, 30 Jun 2025 22:48:56 +0000

QR Codes:

Paste this QR Code to cite your computation.

Original text written by user:

IsPrivate?

No (this computation is public)

User-defined keywords

Severijns Britt

Estimated Impact

245

Family? (F = Feedback message, R = changed R code, M = changed R Module, P = changed Parameters, D = changed Data)

-     [Bivariate Kernel Density Estimation] [Variuos EDA topic...] [2008-11-09 13:17:45] [3548296885df7a66ea8efc200c4aca50]
F    D    [Bivariate Kernel Density Estimation] [various EDA topic...] [2008-11-12 18:30:35] [78308c9f3efc33d1da821bcd963df161] [Current]

Feedback Forum

2008-11-23 15:37:47 [Aurélie Van Impe] [reply] 
Je hebt de grafieken wel gemaakt, maar je bent vergeten er commentaar bij te geven. Je had bij deze grafiek kunnen schrijven dat het gaat om een weergave van 2 dimensies. En dat de hoogtelijnen dienen om de concentratie weer te geven. De roze gebieden geven de kern weer, in kern is de waarschijnlijkheid het grootst dat hier gegevens liggen. Je ziet op de tekening meerdere kernen, en dus meerdere groepen van gegevens. Aangezien de kernen een ellipsvorm hebben, van links onder naar rechts boven, kunnen we zeggen dat er een positieve correlatie is tussen de 2 reeksen gegevens. Links van de rechte lijn, die het gemiddelde van alle gegevens voorstelt, zie je nog een cluster van gegevens. Deze gegevens wijken een beetje af van de andere gegevens, maar ook zij vormen een ellips, dus er is duidelijk een positief verband tussen deze twee variabelen. Wat ook logisch is, want beide variabelen gaan over het aantal werklozen. Dus het zou onlogisch zijn als er geen verband was tussen deze variabelen.
2008-11-23 16:11:22 [Alexander Hendrickx] [reply] 
De grafieken werden goed gemaakt, maar er staat geen conclusie bij. De biavariate kernel density plot is een goede manier om correlaties visueel voor te stellen. Ook de concentratie van de gegevens zijn gemakkelijk af te lezen aan de hand van de hoogtelijnen. Hoe dichter deze bij elkaar liggen hoe groter de concentratie en hoe meer kans je dus hebt dat een punt daar zal liggen. Met de partial correlatie, kan je twee variabelen met elkaar vergelijken terwijl je een derde constant houdt, wat handig is om de invloed van de verschillende variabelen op de correlatie te analyseren.
2008-11-23 17:33:21 [7bf28d4d60530086dbc44ae6b648927e] [reply] 
De bivariate Kernel density plot is een manier om het verband te zien tussen 2 variablen x en y. Er zijn hoogtlijnen te zien. Als er elipsvormige hoogtelijnen zijn dan is er een positief verband. De scatterplots zijn dan lineair. Wanneer ze cirkelvormig zijn is er geen verband. Hier is er wel een positief verband aanwezig.

Post a new message

Dataseries X:

Download CSV

Histogram

Boxplots

Dataseries Y:

Download CSV

Histogram

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	1 seconds
R Server	'Gwilym Jenkins' @ 72.249.127.135

\begin{tabular}{lllllllll}
\hline
Summary of computational transaction \tabularnewline
Raw Input & view raw input (R code)  \tabularnewline
Raw Output & view raw output of R engine  \tabularnewline
Computing time & 1 seconds \tabularnewline
R Server & 'Gwilym Jenkins' @ 72.249.127.135 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24354&T=0

[TABLE]
[ROW][C]Summary of computational transaction[/C][/ROW]
[ROW][C]Raw Input[/C][C]view raw input (R code) [/C][/ROW]
[ROW][C]Raw Output[/C][C]view raw output of R engine [/C][/ROW]
[ROW][C]Computing time[/C][C]1 seconds[/C][/ROW]
[ROW][C]R Server[/C][C]'Gwilym Jenkins' @ 72.249.127.135[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24354&T=0

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24354&T=0

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Summary of computational transaction
Raw Input	view raw input (R code)
Raw Output	view raw output of R engine
Computing time	1 seconds
R Server	'Gwilym Jenkins' @ 72.249.127.135

Bandwidth
x axis	8051.24435041897
y axis	22572.23889631
Correlation
correlation used in KDE	0.792846709181054
correlation(x,y)	0.792846709181054

\begin{tabular}{lllllllll}
\hline
Bandwidth \tabularnewline
x axis & 8051.24435041897 \tabularnewline
y axis & 22572.23889631 \tabularnewline
Correlation \tabularnewline
correlation used in KDE & 0.792846709181054 \tabularnewline
correlation(x,y) & 0.792846709181054 \tabularnewline
\hline
\end{tabular}
%Source: https://freestatistics.org/blog/index.php?pk=24354&T=1

[TABLE]
[ROW][C]Bandwidth[/C][/ROW]
[ROW][C]x axis[/C][C]8051.24435041897[/C][/ROW]
[ROW][C]y axis[/C][C]22572.23889631[/C][/ROW]
[ROW][C]Correlation[/C][/ROW]
[ROW][C]correlation used in KDE[/C][C]0.792846709181054[/C][/ROW]
[ROW][C]correlation(x,y)[/C][C]0.792846709181054[/C][/ROW]
[/TABLE]
Source: https://freestatistics.org/blog/index.php?pk=24354&T=1

Globally Unique Identifier (entire table): ba.freestatistics.org/blog/index.php?pk=24354&T=1

As an alternative you can also use a QR Code:

The GUIDs for individual cells are displayed in the table below:

Bandwidth
x axis	8051.24435041897
y axis	22572.23889631
Correlation
correlation used in KDE	0.792846709181054
correlation(x,y)	0.792846709181054

Figure 1

PNG link

Postscript link

PDF link

Parameters (Session):

par1 = 50 ; par2 = 50 ; par3 = 0 ; par4 = 0 ; par5 = 0 ; par6 = Y ; par7 = Y ;

Parameters (R input):

par1 = 50 ; par2 = 50 ; par3 = 0 ; par4 = 0 ; par5 = 0 ; par6 = Y ; par7 = Y ;

R code (references can be found in the software module):

par1 <- as(par1,'numeric')
par2 <- as(par2,'numeric')
par3 <- as(par3,'numeric')
par4 <- as(par4,'numeric')
par5 <- as(par5,'numeric')
library('GenKern')
if (par3==0) par3 <- dpik(x)
if (par4==0) par4 <- dpik(y)
if (par5==0) par5 <- cor(x,y)
if (par1 > 500) par1 <- 500
if (par2 > 500) par2 <- 500
bitmap(file='bidensity.png')
op <- KernSur(x,y, xgridsize=par1, ygridsize=par2, correlation=par5, xbandwidth=par3, ybandwidth=par4)
image(op$xords, op$yords, op$zden, col=terrain.colors(100), axes=TRUE,main=main,xlab=xlab,ylab=ylab)
if (par6=='Y') contour(op$xords, op$yords, op$zden, add=TRUE)
if (par7=='Y') points(x,y)
(r<-lm(y ~ x))
abline(r)
box()
dev.off()
load(file='createtable')
a<-table.start()
a<-table.row.start(a)
a<-table.element(a,'Bandwidth',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'x axis',header=TRUE)
a<-table.element(a,par3)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'y axis',header=TRUE)
a<-table.element(a,par4)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'Correlation',2,TRUE)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'correlation used in KDE',header=TRUE)
a<-table.element(a,par5)
a<-table.row.end(a)
a<-table.row.start(a)
a<-table.element(a,'correlation(x,y)',header=TRUE)
a<-table.element(a,cor(x,y))
a<-table.row.end(a)
a<-table.end(a)
table.save(a,file='mytable.tab')

Free Statistics

Description of Statistical Computation

Tree of Dependent Computations

Dataset

Tables (Output of Computation)

Figures (Output of Computation)

Input Parameters & R Code