ftp://ftp.ncbi.nih.gov/pub/geo/DATA/supplementary/
https://www.broadinstitute.org/medical-and-population-genetics/hapmap-3
ftp://ftp.illumina.com/
user name:guest
password:illumina
http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/1000_genomes_project/release/20181203_biallelic_SNV/
Every time I think I know what's going on, suddenly there's another layer of complications.
2018年12月29日星期六
2018年12月25日星期二
Manhattan Plot for GWAS
if(!require("ggplot2")) {
print("Please install the ggplot plackage")
## install.packages('ggplot2', dep=T)
}
require(ggplot2)
d <- read.table("simulated_assoc.assoc",header=T)
d <- na.omit(d)
maxy <- 15
sig <- -log10(5e-8)
d$logp <- -log10(d$P)
d$x <- NA
ticks <- NULL
lastbase <- 0
order <- order(d$CHR,d$BP)
d <- d[order,]
d[d$logp > maxy,'logp'] <- maxy
numchrs <- length(unique(d$CHR))
for( i in unique(d$CHR) ) {
if( i==1 ) {
d[d$CHR==i,]$x <- d[d$CHR==i,]$BP
} else {
lastbase <- lastbase+tail(subset(d,CHR==i-1)$BP,1)
d[d$CHR==i,]$x <- d[d$CHR==i,]$BP+lastbase
}
ticks <- c(ticks,d[d$CHR==i,]$x[floor(length(d[d$CHR==i,]$x)/2)+1])
}
ticklim <- c(min(d$x),max(d$x))
if(numchrs > 1) {
cols <- rep(c("gray","black"),max(d$CHR))
} else {
cols <- "gray"
}
if (maxy=="max") maxy=ceiling(max(d$logp)) else maxy=maxy
if (maxy<8) maxy=8
pdf("manhattan_plot.pdf")
plot <- qplot(x,logp,data=d,
ylab=expression(-log[10](italic(p))) , colour=factor(CHR))
plot <- plot + scale_x_continuous(name="Chromosome",
breaks=ticks,
labels=(unique(d$CHR)))
plot <- plot + scale_y_continuous(limits=c(0,maxy),
breaks=1:maxy)
plot <- plot + scale_colour_manual(values=cols)
plot <- plot + theme(legend.position="none")
plot <- plot + labs(title="Manhattan Plot")
plot <- plot + theme(panel.background=element_blank(),
panel.grid.minor=element_blank())
plot <- plot + geom_abline(intercept=sig,slope=0, col="black")
2018年12月23日星期日
some commands for Ubuntu
- cd /home/user/Download
- sudo cp filename /usr/bin
If you want to copy a folder from your download then just add -r before cp command
sudo cp -r foldername /usr/bin
you can use mv command to move that file or folder
sudo mv filename /usr/bin
3. awk: line 2: missing } near end of file
use awk '{print $1,$2,$4,$5}'
3. awk: line 2: missing } near end of file
use awk '{print $1,$2,$4,$5}'
2018年12月14日星期五
2018年10月11日星期四
steps for simulation sample size
- Write the statistical model mathematically.
- Generate the sample according to the model with sample size N.
- Fit the model, perform the test, and record the rejection or acceptance of hull hypothesis.
- Repeat step 2 and 3 n (generally I used 5000) times.
- Calculate the power by (# of rejections)/n.
6 If power is too higher, decrease sample size N, repeat 2 - 5. If power is too lower, increase sample size N, repeat 2 - 5.
2018年9月25日星期二
number of basis fuction for spline regression
The number of basis function is n+d+1, where n is the number of (internal) knots, where d is the degree of the basis function.
2018年6月13日星期三
2018年6月4日星期一
2018年5月21日星期一
Producing graphs of the survival and baseline hazard function after Cox regression
- When only
plots=survival
is specified on the proc phreg
statement, SAS will produce one graph, a “reference curve” of the survival function at the reference level of all categorical predictors and at the mean of all continuous predictors.
https://stats.idre.ucla.edu/sas/seminars/sas-survival/
plots=survival
is specified on the proc phreg
statement, SAS will produce one graph, a “reference curve” of the survival function at the reference level of all categorical predictors and at the mean of all continuous predictors.2018年4月18日星期三
Heine–Borel theorem
https://ipfs.io/ipfs/QmXoypizjW3WknFiJnKLwHCnL72vedxjQkDDP1mXWo6uco/wiki/Heine%E2%80%93Borel_theorem.html
2018年4月8日星期日
2018年2月28日星期三
A SAS macro to rename all variable names in a data set
Data one;
U_2=1;
V_3=2;
X_2=3;
Y_2=4;
Z_3=5;
Run;
options macrogen mprint mlogic;
%macro rename(lib,dsn);
options pageno=1 nodate;
proc contents data=&lib..&dsn;
title "Before Renaming All Variables";
run;
proc sql noprint;
select nvar into :num_vars
from dictionary.tables
where libname="&LIB" and
memname="&DSN";
select distinct(name) into :var1-
:var%TRIM(%LEFT(&num_vars))
from dictionary.columns
where libname="&LIB" and
memname="&DSN";
quit;
run;
proc datasets library=&LIB;
modify &DSN;
rename
%do i=1 %to &num_vars;
&&var&i=%substr(&&var&i., 1, %length(&&var&i.)-2)
%end;
;
quit;
run;
options pageno=1 nodate;
proc contents data=&lib..&dsn;
title "After Renaming All Variables";
run;
%mend rename;
%rename(WORK,ONE);/***Note the library and database name should be all in CAPITAL LETTERS)****/
U_2=1;
V_3=2;
X_2=3;
Y_2=4;
Z_3=5;
Run;
options macrogen mprint mlogic;
%macro rename(lib,dsn);
options pageno=1 nodate;
proc contents data=&lib..&dsn;
title "Before Renaming All Variables";
run;
proc sql noprint;
select nvar into :num_vars
from dictionary.tables
where libname="&LIB" and
memname="&DSN";
select distinct(name) into :var1-
:var%TRIM(%LEFT(&num_vars))
from dictionary.columns
where libname="&LIB" and
memname="&DSN";
quit;
run;
proc datasets library=&LIB;
modify &DSN;
rename
%do i=1 %to &num_vars;
&&var&i=%substr(&&var&i., 1, %length(&&var&i.)-2)
%end;
;
quit;
run;
options pageno=1 nodate;
proc contents data=&lib..&dsn;
title "After Renaming All Variables";
run;
%mend rename;
%rename(WORK,ONE);/***Note the library and database name should be all in CAPITAL LETTERS)****/
2018年1月9日星期二
Some reviews for Martingales
1. Simple Random Walk
2. Sample path
2. Sample path
https://math.stackexchange.com/questions/1472068/what-is-a-sample-path-of-a-stochastic-process
订阅:
博文 (Atom)