Hi Dakai, can you install the R code somewhere on cedric (or whereever S+ is kept). Sanjay, you may be interested in Jaya's comments. ----- Forwarded by Chris Amos/MDACC on 07/28/2004 02:53 PM ----- "Satagopan, Jaya M./Epidemiology-Biostatistics" 07/28/2004 11:15 AM To: camos@mdanderson.org cc: "Satagopan, Jaya M./Epidemiology-Biostatistics" Subject: RE: two stage designs Hi Chris, Nice to hear from you. I am attaching the R code for two-stage design as a text file. This pertains to the Genetic Epidemiology publication of last year. Instructions on how to use this code are given in this file. I am assuming this is the one you are mentioning in your email. Let me know if this is the paper you had in mind. Also, please let me know if you have any questions about the code. Thanks. I have not yet applied this method to a data set. Here are my some of my thoughts on items 2 and 3. Please let me know what you think about this. Suppose we have accrued n1 cases and n0 controls. We utilize the same number of n = n1+n0 subjects in one-stage as well as two-stage designs. The only difference is that we genotype fewer loci under a two-stage scenario, thus saving on the genotyping cost. Suppose the cost of genotyping a single locus is the same for a case as well as a control. Say, R1 is the relative cost of recruiting a case to the cost of genotyping a single polymorphism in a case, and likewise R0 represents the relative cost in the controls. Then, the cost of the one-stage design would be T1 = n1*R1 + n0*R0 + (n1+n0)*m, where m is the total number of loci under consideration. The corresponding cost of a two-stage design would be T2 = n1*R1 + n0*R0 + (n11+n01)*m + (n1-n11 + n0-n01)*m1. Here n11 < n1 and n01