vendredi 21 mai 2021

Additional zeros in vector

I am trying to fit a histogram of values 1-6 associated with strings in a data set. I keep getting values equaling zero in my data as well as NA's even though I thought I eliminated them.

LicenseType=TexasData$License.Type
LicenseType=LicenseType[CountyInside]
LicenseType = gsub("[^[:alpha:]]", "NA", LicenseType)
LicenseTypeNumber=integer(length = length(LicenseType))


na.omit(TexasData$License.Type)
unique(LicenseType)

for (i in 1:length(LicenseType)) {
  if(is.na(LicenseType))
    LicenseType=FALSE
  else{
    if(LicenseType[i]=="SALE")
      LicenseTypeNumber[i]=1
    else if(LicenseType[i]=="BRK")
      LicenseTypeNumber[i]=2
    else if(LicenseType[i]=="BLLC")
      LicenseTypeNumber[i]=3
    else if(LicenseType[i]=="BCRP")
      LicenseTypeNumber[i]=4 
    else if(LicenseType[i]=="6")
      LicenseTypeNumber[i]=6
    else if(LicenseType[i]=="REB")
      LicenseTypeNumber[i]=5
  }
  
}
LicenseTypeNumber
dput(LicenseTypeNumber)

hist(LicenseTypeNumber, main="Licence Type by Agents Histogram", ylab = "Agents", xlab = "License Type")

I thought of using gsub but I am unsure if I am using it wrong or if that wont work on this data set. any opinions? When I first run dput I get values I am expecting, however it quickly adds these zeros. Sample of dput: 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,

Aucun commentaire:

Enregistrer un commentaire