I am trying to fit a histogram of values 1-6 associated with strings in a data set. I keep getting values equaling zero in my data as well as NA's even though I thought I eliminated them.
LicenseType=TexasData$License.Type
LicenseType=LicenseType[CountyInside]
LicenseType = gsub("[^[:alpha:]]", "NA", LicenseType)
LicenseTypeNumber=integer(length = length(LicenseType))
na.omit(TexasData$License.Type)
unique(LicenseType)
for (i in 1:length(LicenseType)) {
if(is.na(LicenseType))
LicenseType=FALSE
else{
if(LicenseType[i]=="SALE")
LicenseTypeNumber[i]=1
else if(LicenseType[i]=="BRK")
LicenseTypeNumber[i]=2
else if(LicenseType[i]=="BLLC")
LicenseTypeNumber[i]=3
else if(LicenseType[i]=="BCRP")
LicenseTypeNumber[i]=4
else if(LicenseType[i]=="6")
LicenseTypeNumber[i]=6
else if(LicenseType[i]=="REB")
LicenseTypeNumber[i]=5
}
}
LicenseTypeNumber
dput(LicenseTypeNumber)
hist(LicenseTypeNumber, main="Licence Type by Agents Histogram", ylab = "Agents", xlab = "License Type")
I thought of using gsub but I am unsure if I am using it wrong or if that wont work on this data set. any opinions? When I first run dput I get values I am expecting, however it quickly adds these zeros. Sample of dput: 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
Aucun commentaire:
Enregistrer un commentaire