Using a sampling weight to correct for unbalanced panel data
Hi, I am conducting regressions on panel data. My y variable is wage. As the panel is unbalanced, the individuals contributing more wage observations carry more weight in the pooled sample, and the...
View ArticleMultivariate analyses with small sample size and count data
Hello, This is my first forum post. As a quick introduction, I'm a master's student in public health working on my thesis project, and I'm very excited to have found out about the statalist forum....
View Articlemerging two data sets.
I want to merge two datasets. Both contain the same company name. But the problem is one file has multiple years. e.g.company name. Year Alpha 2005 Aplha 2006 Alpha 2007 Alpha 2008 Beta 2007 Beta 2008...
View ArticleError r(5004): Java installation not found
Hi: I am posting this message on behalf of a friend who is facing trouble reading data in Stata. She is trying to import a large .csv file in Stata by issuing the following command Code: import...
View ArticleCounting the number of non-distinct identifiers in two variables
Hi, I have two variables, Partner1 and Partner2, which contain unique passport numbers in string format. An example: Partner 1 Partner 2 E101 E100 E102 E103 E100 E101 E105 E106 E107 E108 Over here, we...
View ArticleAbout areg command for fixed effects
Hi. I have a question about areg command for fixed effects. areg y x1, absorb(x2) predict x2_fixed, d Using above command, I can get coefficients (fixed effects) of x2. If I run this command: reg y x1...
View ArticleChanging variable names with variable labels
Hi all, I have been researching and trying out some commands. May I know why is it that these two categories of commands are literally the same but the bottom one has syntax error at the renaming...
View ArticleRegression with plausible values
Hello Stata community, I am doing analysis with the PIAAC dataset. Stata has a PIAAC tools package to treat this specific dataset: ssc install piaactools This package allows me to do regressions that...
View ArticleInterpolating with percentage growth rate
Dear Statalisters, I have two time-series dataset with time-series X and Y. X is quarterly while Y is annual. I'm trying to interpolate Y from annual to quarterly data, using the growth rate of X. For...
View Articlextreg/mixed standard deviation
I'm doing some analysis on school value added where I estimate a regression of the form yit = a + xit B + vi + eit I'm specifically interested in the distribution of the unit-specific error term vi. I...
View ArticleXtgee with Unbalanced Panel in Stata
I've been reading some conflicting articles online regarding using a GEE model with unbalanced and balanced panel data, so I was wondering if anyone on Statalist can chime in. I'm currently using a GEE...
View ArticleHausman test not working
I am running a regression inluding the following variables: lnwage, age, age squared, experience, experiencesquared as well as others. When I conduct the Hausman test for fixed effects, I get a message...
View ArticleGeonear error term
Hi All, I am using geonear on some really massive datasets (several million longitude latitude pairs and comparing those to about 700 addresses). To deal with this I am looping through the file and...
View ArticleCorrect standard errors in panel data
Hello everyone, I have a sample of 236 observations in a slightly unbalanced panel. T=24, N=10. I'm estimating the effect from bank leverage on their equity betas. How do I correct the standard errors...
View ArticleRepeated time value error in Stata when declaring Panel Data
Hi, So I am working on a class project, where my testable hypothesis is that "Players playing position on the soccer field influence his overall rating" The data set I found was in sql, so I had to...
View Articlerow sorting?
Suppose that I have the following data: Code: * Example generated by -dataex-. To install: ssc install dataex clear input float(id x1 x2 x3 x4 x5 x6) 1 13 14 12 2 23 56 2 2 34 56 43 21 12 3 2 3 45 1 23...
View Articlenl command question and non linear models fitting.
Hello everyone, I have a dataset of growth (time vs frequency) and I am using the nl command to see how different non linear models fit. I have the following questions: 1) I found old documents (1992)...
View Articlespmap help
Hi, I'm trying to create a map of a binary outcome variable layered onto census track data. My dataset contains 23, 726 variables. I installed the spmap and shp2dta programs and I was able to download...
View ArticleStandardizing variables via "center" command
Hey everyone, I came across something peculiar. Apparently, while "center" should create a mean zero and standard deviation of one variable, I had an application where the standard deviation of the...
View ArticleFrequency domain granger causality test
Hi, Im not sure if this is the right place to post this... but anyway... I have a time series data set with annual data from year 1996 to 2015. But data from 1997, 1999 and 2001 is missing. I will...
View Article