Merging geographical special license data
I've merged two geographical special license datasets (SN7630 and SN7248) with the main dataset (SN6614) for each wave 1-9 using the variable hidp, but there appears to be more observations (hidps) in the special license datasets than the main dataset. Approximately 77% of observations across waves 1-9 are matched, 23% in special licence and not matched, and <1% in the main dataset and not matched.
I presume there's either an error in my code or a valid reason for this? Any help would be appreciated.
#1 Updated by Alita Nandi about 2 months ago
- Private changed from Yes to No
- % Done changed from 0 to 50
- Assignee set to James Risk
- Status changed from New to Feedback
The geographical special license data files include households with valid postcodes including those that did not respond.
So, if you merge these geography files with w_hhsamp (households issued including non-responding households) then these would match except for the households in w_hhsamp without valid postcodes. But if you merge these geography files with w_hhresp, w_indall, w_indresp which include responding households only, then there will be households in the geography files that will not appear in these files.
Does this answer your question?