Data User Guide

Data User Guide

Data User Guide – September 2019
Ten to Men logo

Copyright

Some of the material included or referred to in the Data User Guide is subject to the copyright. Copyright information will progressively be added to the study materials. For more information about copyright permissions, email us at: ttmdatamanager@aifs.gov.au.

The Australian Institute of Family Studies (AIFS) does not guarantee the accuracy, currency or completeness of any copyright information provided in the Ten to Men study materials.

  • AIFS accepts no legal liability in relation to any use of the material included or referred to in the Data User Guide.
  • Users should ensure that they seek and obtain the appropriate copyright permissions from the owner of any copyright in this material prior to use.
  • AIFS is not responsible for obtaining or assisting users with copyright permissions.

Read the publication

About this guide

About this guide
This Data User Guide is a reference tool for users of the Ten to Men: The Australian Longitudinal Study on Male Health datasets.
It is intended to provide the necessary information to be able to use the Ten to Men data. This includes information on the study design, questionnaires, data files, data linkages, user resources, statistical considerations and how to access the Ten to Men data.
Additional resources available for users of the Ten to Men data include:

  • Questionnaire
  • Data Dictionary
  • Data Books
  • Data Issues Paper.

If you have any feedback, had difficulty understanding any of the Data User Guide's content or would like us to include additional information, please do not hesitate to email us at: ttmdatamanager@aifs.gov.au

1. Introduction to Ten to Men

1. Introduction to Ten to Men
1.1 Background and objectives
The Commonwealth Department of Health commissioned the Australian Longitudinal Study on Male Health (Ten to Men) in 2011 in response to the 2010 National Male Health Policy, which recognised significant gaps in the knowledge on male health. Addressing those gaps and building the evidence base on male health was identified as a priority to reduce disparities in male health, both between males and females, and between different sub-populations of males. Reflecting that focus, the objectives of Ten to Men are:

  • to examine male health and its key determinants including social, economic, environmental and behavioural factors that affect the length and quality of life of Australian males
  • to address a range of key research questions about the physical and mental health of Australian males, including their health behaviours and risk factors, key life transition points, social and economic environments in which they work and live together and their use of health and other services
  • to identify policy opportunities for improving the health and wellbeing of males and providing support for males at key life stages, particularly those at risk of poor health.

The content of Ten to Men was designed around six key research domains relevant to male health:

  • wellbeing and mental health
  • use of health services
  • health-related behaviours
  • health status
  • health knowledge
  • social determinants.

1.2 Ten to Men study design and sampling
Ten to Men is Australia's first national longitudinal study that focuses exclusively on male health and wellbeing. A number of key decisions informed the development of the final sampling plan for Ten to Men:

  • The decision to conduct face-to-face interviews with boys aged 10-14 years: The cost of conducting face-to-face interviews ruled out a simple random sample and introduced the need for clustering.
  • The decision to oversample regional males: Oversampling regional males introduced the need for some mechanism to stratify the sample by region.
  • The decision to adopt a household recruitment methodology: Household recruitment requires defined geographic units and also required clustering.
  • The decision that a population sample would be drawn at the national level with no stratification by state.

The final Ten to Men cohort was recruited using a stratified, multi-stage, cluster random sampling design to select males aged 10-55 from the target population (Currier et al., 2016).
1.3 Ten to Men recruitment
To date, two waves of data collection have been completed for the Ten to Men study. Recruitment of eligible participants and Wave 1 of the data collection occurred between October 2013 and July 2014. Wave 2 of the data collection occurred between November 2015 and May 2016.
Fieldworkers approached 104,884 households in each of the selected Statistical Areas (SA1). They determined if there were any eligible males for inclusion in the Ten to Men study (eligibility criteria included: males aged 10-55 years, Australian permanent residents or citizens, sufficient skills in English to complete the survey).
There were 33,724 households, with 45,510 males, that were eligible for inclusion in Ten to Men at Wave 1. Initially 15,988 males aged 10-55 years were recruited into the Ten to Men study. All participants involved in Wave 1 of the study were invited to participate in Wave 2 of the study.
During Wave 2 of Ten to Men, 33 additional participants were identified for Wave 1. They were not included in the original Wave 1 dataset as their eligibility and consent status had not been determined at that stage, but this issue was resolved during Wave 2. These participants were subsequently included in Wave 1, taking the reconciled sample size for Wave 1 to 16,021.
1.4 Ten to Men cohort
The reconciled cohort sample size for Wave 1 is 16,021 males, with three age-based cohorts: boys aged 10-14 years, young men aged 15-17 years and adults aged 18-55 years.
Figure 1: Number of participants across Waves 1 and 2 by each cohort

Credit: Australian Institute of Family Studies 2019 (aifs.gov.au/copyright)
Figure 1 shows the distribution of the males between the three age-based cohorts for Wave 1. The overall retention rate of participants from Wave 1 to Wave 2 was 75%. This rate varied between the three cohorts - 74% for the adults, 71% for young men and 79% for boys.
Figure 1 also shows the movement of the participants between the age-based cohorts from Wave 1 to Wave 2. The type of questionnaire completed at Wave 2 was based on the participant's age at the time of the Wave 2 survey.

2. Ten to Men questionnaires and data collection

2. Ten to Men questionnaires and data collection
Four questionnaires were developed for the Wave 1 data collection:

  • Adult Males (18-55 years)
  • Young Men (15-17 years)
  • Boys (10-14 years)
  • Parents of the Boys (10-14 years). This questionnaire was separated into two parts:
    • Part A - information about the study boy
    • Part B - information about the household and parents.

Excluding the Boys (10-14 years), all were self-completed hard copy questionnaires. A computer assisted personal interview (CAPI) was used to collect the data for the Boys (10-14 years).
The same method was used for the Wave 2 data collection, except that the questionnaire for the Parents of the Boys was not separated into Part A and Part B.
The questionnaires for each wave are available at tentomen.org.au/data-access-and-usage/data-documentation/questionnaire-sources-and-permissions.

3. Ten to Men data

3. Ten to Men data
One of the key aims of the study is to ensure that the Ten to Men datasets are widely available for researchers to undertake analyses on male health. The datasets are large and complex, consisting of over 800 variables collected across the three cohorts for Wave 1 and Wave 2. As well as the raw data, a number of derived variables and indicator variables (questionnaire, survey date, etc.) are included.
This section provides information about the Ten to Men dataset releases, structure of the datasets, variable naming conventions, coding frameworks and confidentialisation. It is important that researchers read and understand this section.
3.1 Dataset releases
Periodically a new release of the Ten to Men datasets will be generated as additional information becomes available after each data collection wave. The releases will be numbered in sequential order and a new Digital Object Identifier (DOI) will be minted. A history of the dataset releases and suggested citations can be found in Appendix A: Dataset releases.
Release 2.1 was the most recent data release and comprised of updated Wave 1 and Wave 2 datasets. These datasets have undergone changes to previous releases, including the renaming of variables, confidentialisation and other modifications as listed in detail in the Ten to Men Data Issues Paper.
3.2 Datasets available
Releases 1.0 and 2.0 were issued by the University of Melbourne. Release 1.0 contains data from Wave 1 only, while Release 2.0 contains data from both Wave 1 and Wave 2. An additional dataset, Respondent, was also available. The Respondent dataset contained key indicator data, such as the unique study identifier, age, household identifier and geographical information.
Release 2.1 was released by AIFS and comprised of updated Wave 1 and Wave 2 datasets. Relevant data from the respondent data has been included in the dataset within the wave, and this is no longer available as a separate dataset.
A Ten to Men dataset naming convention was developed to ensure that each dataset name included information about the type of release and wave. The four datasets available for Release 2.1 are:

  • TTMGRW1 - Ten to Men, General Release, Wave 1
  • TTMRRW1 - Ten to Men, Restricted Release, Wave 1
  • TTMGRW2 - Ten to Men, General Release, Wave 2
  • TTMRRW2 - Ten to Men, Restricted Release, Wave 2.

3.3 Merging waves of data
A one-to-one merge can be used to combine the Ten to Men datasets across waves. The common variable to all the datasets that should be used for the merge is the unique study identifier (zdcid0001md).
3.4 Variable naming conventions
The Ten to Men variable naming convention was developed so that the variables have predictable names and characteristics across the waves.
This structure is presented in Figure 2. The standard format is W RD xxxxxx Q D, where:
W = Wave
RD = Research Domain
xxxxxx = Variable descriptor
Q = Questionnaire
D = Derived (optional)
Using the variable 'abpactlevud' as an example:
Wave = a (Wave 1)
Research Domain = bp (Behaviours - Physical Activity)
Variable descriptor = actlev (Activity level)
Questionnaire = u (Males under 18 years)
Derived = d (derived variable)
Figure 2: Example of variable naming convention

Credit: Australian Institute of Family Studies 2019 (aifs.gov.au/copyright)
Wave indicator
The wave indicator is the first character in the variable name and indicates the wave. For example:

  • a indicates Wave 1
  • b indicates Wave 2
  • z indicates any variable that is common across all waves (i.e. time invariant variables).

Research domain indicator
The second and third characters in the variable name represent the research domain of the corresponding question. For example, 'xbaxxxxxx' has 'ba' as the second and third characters, which represent the research domain of 'Behaviours - Alcohol'.
A list of research domains and abbreviations is provided in Table 1.
Table 1: Research domains and abbreviations

AbbreviationResearch domain

ba
Behaviours - Alcohol

bd
Behaviours - Drugs

bh
Behaviours - Help Seeking

bn
Behaviours - Nutrition

be
Behaviours - Peers

bp
Behaviours - Physical Activity

br
Behaviours - Risky and Antisocial Behaviour

bx
Behaviours - Sexual Behaviour

bs
Behaviours - Sun Exposure

bt
Behaviours - Tobacco

bw
Behaviours - Weight

dc
Data Collection

hb
Health Status - Pubertal Development

hd
Health Status - Diagnoses

hi
Health Status - Injury and Disability

hp
Health Status - Prostrate Function

hs
Health Status - Health Status

hx
Health Status - Sexual Function

hz
Health Status - Sleep

kl
Knowledge - Health Literacy

ld
Linked Data

se
Social Determinants - Socioeconomic Status

sf
Social Determinants - Family and Household Structure

sg
Social Determinants - Gender Roles and Sexuality

sh
Social Determinants - Housing and Tenure

si
Social Determinants - Interpersonal Relations

sl
Social Determinants - Life Events

st
Social Determinants - Transport

sw
Social Determinants - Work Satisfaction, Control and Security

ua
Use of Health Services - Access to Health Services

up
Use of Health Services - Satisfaction and Preferences

us
Use of Health Services - Service Use

wd
Wellbeing - Mental Health Diagnoses

wl
Wellbeing - Life Satisfaction

wp
Wellbeing - Positive Mental Health

ws
Wellbeing - Self-Injury

Variable descriptor indicator
The variable descriptor is represented by the fourth to the ninth characters in the variable name. A combination of first letter, acronyms and abbreviations is used in order to provide an abbreviation of the full variable description. Zeroes have also been used where less than six characters are required to describe the attribute. Other numerical values have been used to describe variables with similar attributes.
Questionnaire indicator
The 10th character of the variable name represents the questionnaire indicator. There are 10 possible values, with each value representing the questionnaire/s that the variable is available for. For example, ' class="small"xg' has 'g' as the 10th character, which indicates that this variable was associated with both the Young Men and Adult questionnaires.
A list of questionnaire indicators and descriptions is provided in Table 2.
Table 2: List of questionnaire indicators and descriptions

IndicatorDescription

a
Adults (18-55 years)

b
Boys (10-14 years)

e
Everyone, including the Parents of boys (10-14 years)

f
Young Men (15-17 years)

g
Young Men (15-17 years), Adults (18-55 years)

i
Field completed by interviewer

m
Everyone, excluding the Parents of boys (10-14 years)

p
Parents of boys (10-14 years)

u
Boys (10-14 years), Young Men (15-17 years)

x
Young Men (15-17 years), Adults (18-55 years), and the Parents of boys (10-14 years)

Derived indicator
The 11th character of the variable name is optional. If present, it is 'd' and indicates that the variable was derived. Some of these derived values are variables consolidated into a common measurement unit, others are a score calculated from one or more other variables.
For example, a participant may answer their height in any measurement unit and these answers have all been converted into centimetres.
3.5 Standard classifications
A number of variables in the Ten to Men datasets are coded using three standard classifications. These include details around country of birth, including parent's country of birth, main language spoken and occupation.
The information for these variables was collected using free text fields, with the responses coded using the standard classifications, as shown in Table 3. More information about these codes can be found from the Australian Bureau of Statistics (ABS) website.
Table 3: Standard classifications used

VariableClassification used

Country of Birth
Standard Australian Classification of Countries (SACC)

Main Language Spoken
Australian Standard Classification of Languages (ASCL)

Occupation
Australian and New Zealand Standard Classification of Occupations (ANZSCO)

3.6 Missing value code frame
Missing values occur when the data value is not stored for a variable and this may happen for a number of reasons. It is important to understand the reasons for the missing values as they can have a significant effect on any conclusions drawn from data.
As missing values are present in the Ten to Men data, a standard missing value code frame has been implemented. This is shown in Table 4
Table 4: Standard missing value code frame used in the data

Numeric valueFormatted valueDescription

-1
Not asked
Used when the question is not asked in the questionnaire

-2
Not applicable
Used when the question is not required due to response to preceding questions

-3
Don't know
Only used if 'Don't know' is an option listed for that question

-4
Refused or not answered
Used when an answer to a question has not been provided or suppressed due to confidentiality reasons

-5
Invalid multiple response
Multiple responses have been provided for a single response question, and the intention is unclear

-6
Value implausible
Value is implausible (as determined after data checking)

-7
Unable to determine value
Where the response provided is unclear

-8
No questionnaire or interview completed
Where a respondent does not return their questionnaire, or participate in an interview

3.7 Types of variables
The Ten to Men dataset contains the following types of variables:

  • Raw data: These are the data collected directly from questionnaire.
  • Key indicators: Each respondent has a number of assigned identification variables, such as Unique ID and Household ID.
  • Categorised: The categorised variables refer to both numerical and qualitative data. Some of the numerical data collected has been sorted into ranges, whereas the qualitative data in response to any 'Other' category have been categorised.
  • Re-classified: A small number of variables were re-classified using the relevant ABS classifications. These variables include the country of birth, languages spoken and occupations.
  • Derived: The questionnaires include sets of questions that have been designed to measure specific attributes. The results of these measures are included as derived variables.
  • Derived & categorised: These variables have been derived and then categorised into groups. This is common when the result of the derived variable is a score, and then the scores are categorised into ranges.
  • Linked data: Additional data, such as geographic and socio-economic data from the ABS, have been linked to the Ten to Men data and provided in the dataset. Further information on linked data is provided in section 4 of this Data User Guide.

3.8 Derived variables
Derived variables simplify and enhance the analytical utility of the Ten to Men dataset. At AIFS, the derived variables have been developed to maintain uniform standards and the consistent application of scales or complex calculations. Many of the derived variables are denoted by the character 'd' as the last character in the variable name.
In the Ten to Men dataset, there are three different types of derived variables:

  • variables where the responses have been converted into a comparable measurement unit
  • variables relating to branching and questionnaire-wide frequencies
  • variables created or calculated based on the responses to a set of questions.

Within the dataset, all derived variables have been positioned with the associated variables that have been used to derive them. For example, the calculation of the Body Mass Index (BMI) uses the height and weight variables, and thus the derived variable of BMI appears in the dataset directly after the height and weight variables.
Comparable measurement units
The Ten to Men dataset contains a number of measurement variables such as height, weight and time. Although these variables may be recorded in a range of different measurement units, they have all been converted to the corresponding standard unit of measurement. For example, height is in centimetres, weight is in kilograms and time is in minutes.
These variables do not have the optional 'd' as the last character in the variable name.
Branching and questionnaire-wide frequencies
The Ten to Men study asked questions about lifetime prevalence, such as 'Have you ever smoked a cigarette?' For any respondent that answered 'No' to these questions, branching logic was used to allow further questions on this topic to be skipped.
Additional questionnaire-wide variables were derived for each question where branching logic was used. This enables the questionnaire-wide current or recent use numbers to be easily obtained for those questions where only a proportion of the respondents answered due to branching logic. For example, based on responses to previous questions about smoking, only a proportion of the respondents would be asked the question, 'Have you smoked a cigarette in the last 12 months?' If this question was skipped due to branching, this would be reported as 'No' in the additional questionnaire-wide variable that would be derived.
It should be noted that questionnaire-wide does not necessarily refer to the entire Ten to Men population - it includes only respondents that received that particular questionnaire/s (Boys, Young Men and/or Adults).
These derived variables contain the text 'All respondents included' in the variable label, as well as the optional 'd' as the last character in the variable name.
Calculated variables and scale scores
The Ten to Men dataset also contains numerous scale scores to assist users in interpreting the data. Each scale score was calculated as directed by the source provided in Table 5. These variables have the optional 'd' as the last character in the variable name.
One new variable has been calculated using standard formulae and responses to questions. Details of this variable are listed below in Table 6.
Table 5: Reference of derived scale scores

MeasureReference

The Active Australia Survey
Australian Institute of Health and Welfare. (2003). The Active Australia Survey: A guide and manual for implementation, analysis and reporting. Canberra: Australian Institute of Health and Welfare.

The Alcohol Use Disorders Identification Test (AUDIT)
Babor, T. F. et al. (2001). AUDIT - The Alcohol Use Disorders Identification Test: Guidelines for Use in Primary Care (Second edition). Geneva, Switzerland: World Health Organization.

Child Special Health Care Needs Screener
The Child and Adolescent Health Measurement Initiative. (1998). The Children with Special Health Care Needs (CSHCN) Screener. Portland, OR: Department of Paediatrics, School of Medicine, Oregon Health & Science University.

Conformity to Masculine Norms Inventory (22-Item Version)
Mahalik, J. R., Locke, B. D., Ludlow, L. H., Diemer, M. A., Scott, R. P. J., Gottfried, M., & Freitas, G. (2003). Development of the Conformity of Masculine Norms Inventory. Psychology of Men & Masculinity, 4(1), 3-25.

Connor-Davidson Resilience Scale
Connor, K. M., & Davidson, J. R. (2003). Development of a new resilience scale: The Connor-Davidson Resilience Scale (CD-RISC). Depression and Anxiety, 18, 76-82 
Campbell-Sills, L., & Stein, M. B. (2007). Psychometric analysis and refinement of the Connor-Davidson resilience scale (CD_RISC): Validation of a 10-item measure of resilience. Journal of Traumatic Stress: Official Publication of the International Society for Traumatic Stress Studies, 20(6), 1019-1028.

General Help Seeking Questionnaire
Wilson, C. J., Deane, F. P., Ciarrochi, J., & Rickwood, D. (2005). Measuring help-seeking intentions: Properties of the General Help-seeking Questionnaire. Canadian Journal of Counselling, 39, 15-28.

Health Literacy Questionnaire
Osborne, R. H, Batterham, R., Elsworth, G. R., Hawkins, M., & Buchbinder, R. (2013). The Grounded psychometric development and initial validation of the Health Literacy Questionnaire (HQL). BMC Public Health, 12, 658.

Job Quality Measure
Leach, L., Butterworth, P., Rodgers, B., & Strazdins, L. (2010). Deriving an evidence-based measure of job quality from the HILDA survey. Australian Social Policy Journal, 9, 67-86.

MOS Social Support Survey
Sherbourne, C. D., & Stewart, A. L. (1991). The MOS social support survey. Social Science & Medicine, 32(6), 705-714.

PedsQL General Wellbeing Scale
Varni, J. W. (2013). Scaling and Scoring of the Paediatric Quality of Life Inventory (PedsQL) Condition - Specific Modules (Version 12). Lyon, France: MAPI Research Trust.

Patient Health Questionnaire (PHQ-9)
Kroenke, K., Spitzer, R. L., & Williams, J. B. (2001). The PHQ-9: Validation of a Brief Depression Severity Measure. Journal of General Internal Medicine, 16, 606-613. 
Richardson, L. P., McCauley, E., Grossman, D. C., McCarty, C. A., Richards, J., Russo, J. E., Rockhill, C., Katon, W. (2010). Evaluation of the Patient Health Questionnaire-9 Item for detecting major depression among adolescents. Paediatrics, 126(6), 1117-1123

Personal Wellbeing Index
The International Wellbeing Group (2006). Personal Wellbeing Index. Melbourne, Australia: Australian Centre on Quality of Life, Deakin University.

Pubertal Development Scale
Carskadon, M. A., & Acebo, C. (1993). A self-administered rating scale for pubertal development. Journal of Adolescent Health, 14(3), 190-195.

Self-determination Scale
Deci, E. L., & Ryan, R. M. (2000). The 'What' and 'Why' of Goal Pursuits: Human Needs and the Self- Determination of Behaviour. Psychological Inquiry, 11(4), 227-268.

SF12 v2.0 Health Survey
Ware, J. E., Kosinski, M., Turner-Bowker, D. M., & Gandek, B. (2002). How to Score Version 2 of the SF-12 Health Survey. Lincoln, RI: QualityMetric Incorporated.

Spence Children's Anxiety Scale
Spence, S. H. (1998). A measure of anxiety symptoms among children. Behaviour Research and Therapy, 36(5), 545-566.

Washington Group Short Set of Questions on Disability
Washington Group on Disability Statistics. (2006). Washington Group Short Set of Questions on Disability. Hyattsville, MD: National Center for Health Statistics, Centers for Disease Control and Prevention.

Table 6: Details of calculated variables

Variable labelFormula

Body Mass Index (BMI)
Weight (kg) / (Height (cm) x Height (cm))

3.9 Confidentialisation
Confidentialisation was undertaken at different levels for all Ten to Men datasets. To increase the availability of information while minimising disclosure risks, a data sharing framework to differentiate the user's access level was implemented. This resulted in two datasets for each wave being generated with different levels of confidentialisation - General Release and Restricted Release.
Restricted Release
A lower level of confidentialisation is applied to the Ten to Men Restricted Release dataset, with all initial information preserved. The only information not included in this dataset is the name, address and other contact details.
The following geospatial items have been confidentialised:

  • SA1 - identifiers were replaced with non-identifiable codes
  • SA2 - identifiers were replaced with non-identifiable codes.
  • Access to the Restricted Release datasets may only be granted where data users are able to demonstrate a genuine need for the additional data and meet the necessary additional security requirements.

General Release
The General Release dataset has undergone additional data confidentialisation in order to:

  • reduce the risk of re-identification of participants
  • ensure that AIFS meets its obligations in accordance with the National Health and Medical Research Council (NHMRC) Guidelines (Australian Code for the Responsible Conduct of Research) and the Australian Privacy Principles
  • permit the transmission of data to overseas users.

In addition to the information removed for the Restricted Release dataset, further confidentialisation for the General Release dataset includes:

  • suppressing some information
  • aggregating some response categories
  • top and/or bottom coding some variables (recoding outlying values to a less extreme value).

Confidentialisation was generally undertaken on variables with low cell counts on identifying or sensitive information. These variables include height and weight, number of children and some health conditions such as schizophrenia, conduct disorder and drug/alcohol problems.
For a complete list of confidentialised variables, users should consult the Ten to Men Data Dictionary.

4. Linked variables

4. Linked variables
Additional variables have been linked to the Ten to Men data and are provided in both the General and Restricted Release datasets. These include geographical variables that have been linked by using the residential location of the participant at the time of the survey.
4.1 State
The state of residence was added to each unit record based on the residential location of the participant at the time of the survey.
4.2 Statistical Area Levels 1 (SA1) and 2 (SA2)
The SA1 and SA2 are geographical areas in the Australian Statistical Geography Standard (ASGS) Main Structure. SA1s are the second smallest unit in the ASGS and have a population of between 200 and 800 people. SA1s aggregate to form SA2s, which have a population between 3,000 to 25,000 people. This structure is based on census data and maintained by the ABS. Detailed information about the ASGS may be sourced from the ABS website.
SA1 and SA2 information was linked to each unit record based on the residential location of the participant at the time of the survey.
For Wave 1, only confidentialised SA1s and SA2s from the 2011 Census were included in the Ten to Men datasets. For Wave 2, confidentialised SA1s and SA2s from both 2011 and 2016 Censuses were included in the Ten to Men datasets.
4.3 Socio-Economic Indexes for Areas (SEIFA)
The SEIFA is a product developed by the ABS that ranks the geographical areas of Australia according to relative socio-economic advantage and disadvantage. The SEIFA are based on information from the census, and currently consist of the following four indexes:

  • Index of Relative Socio-Economic Advantage and Disadvantage (IRSAD)
  • Index of Economic Resources (IER)
  • Index of Education and Occupation (IEO)
  • Index of Relative Socio-Economic Disadvantage (IRSD).
  • Detailed information about the SEIFA may be sourced from the ABS website.

SEIFA information was added to each unit record based on the geographical area of the participant at the time of the survey. Rank, decile and percentage were provided for each index.
For Wave 1, only SEIFA 2011 was included in the dataset. For Wave 2, both SEIFA 2011 and SEIFA 2016 were included in the datasets.
4.4 Australian Statistical Geography Standard (ASGS) Remoteness Area
The Remoteness Area is a geographical classification that defines locations in terms of remoteness. It is based on information from the census and is developed by the ABS. Detailed information about the Remoteness structure may be sourced from the ABS website.
For Wave 1, only Remoteness Area 2011 was included in the Ten to Men datasets. For Wave 2, both Remoteness Area 2011 and 2016 were included in the Ten to Men datasets.
4.5 Modified Monash Model
The Modified Monash Model (MMM) is a geographical classification developed in 2015 by the Department of Health (DoH) to address the disparities in health service access that exist across Australia. It is similar to the ASGS Remoteness classification, but further subdivides regional Australia into categories based on the size of the local town or city. The MMM uses seven categories to define locations. Detailed information about the MMM may be sourced from the DoH website.
This variable has been included in both Wave 1 and Wave 2 datasets.
4.6 Medicare Australia Data
Many of the Ten to Men participants gave consent for their data to be linked with Medicare Australia data. This includes data from the Medicare Benefits Scheme (MBS) and the Pharmaceutical Benefit Scheme (PBS) data. These additional linked data are available on request, subject to approval from the data custodian.

5. Ten to Men data user resources

5. Ten to Men data user resources
There are a number of products available to assist the user in navigating the Ten to Men dataset. These include the questionnaires, Data Dictionary, Data Books, Data Issues Paper and Technical Reports.
5.1 Data Dictionary
The Ten to Men Data Dictionary contains a detailed listing of all variables, including those that have been derived or calculated. The variables are listed in the order that they appear in the dataset, starting with Wave 1.
The Data Dictionary is available as an Excel spreadsheet and therefore the data can be easily sorted, filtered using the drop-down menus or searched according to the user's requirements.
Each row in the Excel spreadsheet describes a single variable and includes the following fields:

  • File name
  • Wave
  • Variable Position in the dataset
  • Research Domain
  • Questionnaire number in each of the four questionnaires
  • Question
  • Variable Label
  • Variable Name
  • Variable name (research domain and activity description only)
  • Format Name
  • Formatted Data values
  • Confidentialised field listing brief details of any confidential steps undertaken for the variable
  • Variable Type as to whether the variable is raw data, derived, calculated, etc.
  • General/Restricted field indicating the availability of the variable
  • Notes field indicating any other information about the variable.

5.2 Data Books
Data Books have been produced for each wave of Ten to Men using the data collected from the questionnaires. Each Data Book contains a frequency table or descriptive statistics for every variable and are useful for simple queries to particular questions.
The frequencies in the Data Books have not been weighted and, except for confidentialisation reasons, there is no editing of any data. Caution needs to be taken when interpreting results from the Data Books, especially any table with extreme values.
The Data Books are available as PDF files, and have been further separated by questionnaire. For example, Wave 1 has three Data Books available - Boys, Young Men and Adults.
5.3 Data Issues Paper
This paper provides a summary of data-related issues that have been identified over the course of Ten to Men. It has been designed to assist users of the data as they undertake research and analysis, and should be read in conjunction with the Data User Guide.
Some data-related issues that have been discussed in the Data Issues Paper include:

  • differences in sample sizes between various data releases
  • changes to the variable names
  • differences in the wording of questions across questionnaires.

Further sections will be added to the Data Issues Paper as any data-related issues emerge.
5.4 Technical Reports
The aim of the Ten to Men Technical Reports is to assist and inform users about specific types of data analysis, study methodology, survey development and any key research findings.
Six Technical Reports for Ten to Men were released by the University of Melbourne between January 2012 and January 2017.
Further Technical Reports are currently under development.

6. Statistical considerations with Ten to Men data

6. Statistical considerations with Ten to Men data
In Ten to Men, the key elements of the sample design were the use of stratification, multi-stage and cluster sampling to select prospective participants and invite them to take part in the study (Currier et al., 2016). This section describes the stratification, multi-stage design, clustering and weights used in Ten to Men. It also provides recommendations for the analysis of data that acknowledges these aspects of the design.
6.1 Stratification
Stratified sampling by remoteness of the residential location was used for the Ten to Men survey. The reference for all geographic units was the ASGS (ABS, 2011).
The ASGS classifies all locations in Australia into one of five levels of remoteness:

  • Major Cities
  • Inner Regional
  • Outer Regional
  • Remote
  • Very Remote.

The areas of Remote and Very Remote were excluded from the sampling, as it was considered too difficult to recruit and conduct interviews in these areas.
The Inner Regional and Outer Regional areas were both over-sampled. They represented 23% and 20% of the sample, whereas the population proportions were 18% and 9% (ABS, 2011).
6.2 Multi-stage design
The main structure of the ASGS comprises five levels of hierarchical spatial units, all of which aggregate progressively upwards. Mesh blocks (MB) are the smallest unit, which aggregate into Statistical Area 1s (SA1), which aggregate into SA2s, then SA3s, and then SA4s.
SA1s are the smallest unit for release of census data and generally have a population of 200-800 persons with an average of 400. Whole SA1s aggregate directly to SA2s in the ASGS structure, and do not cross state or territory borders.
SA2s are medium-sized areas, comprised of whole SA1s and represent a social/economic community unit. They usually have a population range of 3,000-25,000 persons with an average of 10,000 (ABS, 2011).
Major cities
In Major Cities, a random sample of SA1s was obtained proportional to the number of boys in the SA1. Boys were chosen based on the earlier but subsequently superseded plan to oversample young males. It was assumed that the population size of the SA1s in the Major Cities is unlikely to be strongly associated with any particular covariate profiles that might lead to lack of representation of some of the covariate distributions.
Inner and Outer Regional Areas
In the Inner and Outer Regional areas, it was decided to sample SA2s and then SA1s within the sampled SA2s. SA2s vary a lot more in size than SA1s - for example, there were 577 SA2s in the sample frame for Inner Regional, each consisting of between one and 63 SA1s. A random sample of SA2s proportional to size was chosen, where 'size' was the ABS number of boys in the SA2.
In both the Inner and Outer Regional areas, particularly because of the large variation in SA2 size, it was decided to make the sample approximately equally weighted (for individuals). This was done by choosing a fixed number of SA1s (four) in each SA2, thereby compensating for the higher probability of selection of a larger SA2. This is a standard way of producing an equally weighted sample. It means that the chance of any given SA1 in the Inner and Outer Regional areas being selected was approximately the same.
6.3 Cluster sampling
For all three strata, all eligible males in the selected household were invited to participate in Ten to Men. Thus, within a stratum, the Ten to Men study can be described as a cluster sample of eligible households, with SA1s defining the cluster.
Given that SA1s vary in population, SA1s were adjusted to distribute 65%, 20% and 15% of that initial estimated target sample across the Major Cities, Inner Regional and Outer Regional strata respectively. Having identified a target sample for each stratum, the number of SA1s required to meet those targets was determined as follows: Using 2011 Census data and response fractions of 27.1% for 10-17 year olds and 34.1% for adult males, an estimate of participants per SA1 was produced. In each Regional stratum, the participant estimate was summed down the sequentially sampled SA1 list to produce a cumulative total. Based on that total, the number of SA1s required to produce the designated regional proportion of participants per stratum was identified. For a total of 621 SA1s, this resulted in 362 SA1s in Major Cities, 144 in Inner Regional areas and 115 in Outer Regional areas. While the design of the sample did not aim to guarantee state/territory representation, the final sample did include SA1s from every state and (mainland) territory. Table 7 provides the details of the SA1 distribution by state and territory.
Table 7: Distribution of sampled SA1s by state and ASGS Regional Area
[[{"fid":"146","view_mode":"full","fields":{"format":"full","field_file_image_alt_text[und][0][value]":"Table 7: Distribution of sampled SA1s by state and ASGS Regional Area","field_file_image_title_text[und][0][value]":""},"type":"media","attributes":{"height":485,"width":725,"alt":"Table 7: Distribution of sampled SA1s by state and ASGS Regional Area","class":"media-element file-full"}}]]

6.4 Weighting
The Ten to Men study weighting can improve the relation between the inferences about males in the study and inferences about males in the population.
Sample weights can be used to adjust for potential bias due to non-response and unequal sampling fraction, and to allow inferences about population estimates to be made.
The following weights are provided in the Ten to Men datasets:

  • cross-sectional weights for each wave
  • raked cross-sectional weights for each wave
  • longitudinal weights between the waves
  • raked longitudinal weights between the waves.

Cross-sectional weights adjust the sample attained at the current wave to be representative of the population at the time of selection. These weights are most suitable for analysis that makes use only of the one wave of data.
Longitudinal weights adjust the sample that has responded to all waves of Ten to Men to be representative of the population. These weights are most suitable for analysis that makes use of data from many waves.
Raking is an iterative method used to adjust the weights based on socio-demographic characteristics that are available for both the sample and the population. It reduces the non-response bias, and also allows inferences from the sample to population to be made.
A complete list of the Ten to Men weighting variables is provided in Table 8. It is the responsibility of data users to determine when and how weighting needs to be accounted for when developing their analyses.
Table 8: Weighting variables for Ten to Men

Variable nameWavesUsed for

adcicswgtmd
1
Initial cross-sectional weight

adcrcswgtmd
1
Raked cross-sectional weight

bdcicswgtmd
2
Initial cross-sectional weight

bdcrcswgtmd
2
Raked cross-sectional weight

bdcilgwgtmd
1 & 2
Initial longitudinal weights between Wave 1 and Wave 2

bdcrlgwgtmd
1 & 2
Raked longitudinal weights between Wave 1 and Wave 2

7. Accessing Ten to Men Data

7. Accessing Ten to Men Data
The Ten to Men study has provided invaluable longitudinal data about the health of males in Australia. AIFS supports the sharing of the Ten to Men data to increase research opportunities on male health.
The data request process is managed by AIFS in a consistent and appropriate way to ensure the security of the data.
Researchers affiliated with an Australian university or recognised Australian research institute or research organisation, Australian government department or enrolled research students in Australia are eligible to access Ten to Men data. International researchers who collaborate with an eligible Australian university, organisation, institution or department can also apply for access to the data.
The following documents will aid researchers in determining whether Ten to Men data are suitable for their purposes: Ten to Men questionnaires, Data Dictionary, Data Books and the Data Issues Paper.
7.1 Security requirements for accessing Ten to Men data
The privacy of study participants is to be protected at all times. The Ten to Men Terms and Conditions of Data Access and Use detail the security requirements agreed to by users of the Ten to Men data. In summary:

  • Access is limited to those researchers named in the Data User Agreement and Deed of Confidentiality.
  • Upon receipt, the data must be stored on a password-protected computer or electronic device that is only accessible by the individual named in the Data User Agreement.
  • All electronic files, the data and copies of the data must be stored in accordance with the NHMRC Guidelines.
  • Data cannot be accessed or stored outside of Australia. This also applies to copies of the data.

7.2 Data access documentation and forms
A number of data access documents are available for prospective users of the Ten to Men data. Users should familiarise themselves with the access requirements and security protocols outlined in the following documents prior to applying for access.

  • Data Access Request Form: used by data users to apply for access to the Ten to Men data
  • Data Access Amendment Form: for currently approved users who require extension to project timelines, additional personnel, or access to additional datasets
  • Data Access Policy: outlines the procedural and technical aspects of the data access application, review and approval process
  • Terms and Conditions of Data Access and Use: details the legal terms and conditions relating to access, storage and use of the data.

These documents are available on the Ten to Men website. All data access queries should be sent to ttmdatamanager@aifs.gov.au.
7.3 Data access approval process
All users must submit a Data Access Request Form to the Ten to Men Data Manager at ttmdatamanager@aifs.gov.au. Requests are reviewed by the AIFS Data Access Review Committee. Projects must meet the public good criteria, be scientifically and ethically sound and contribute to the knowledge base on male health as described in the study aims and objectives.
Access is limited to those users named in an approved request. Approved users will be notified by the AIFS Ten to Men Data Manager and required to sign a Data User Agreement and Deed of Confidentiality. Upon receipt of the signed Agreement, AIFS will notify the Australian Data Archive (ADA) to release the requested data to the approved applicant. Access to the approved dataset(s) will be available for users to download via secure link provided by the ADA.
Further information about the data access approval process is available in the Ten to Men Data Access Policy.
7.4 Release of Ten to Men data
AIFS, in partnership with the ADA, is using Dataverse (dataverse.ada.edu.au) to facilitate access to the Ten to Men datasets. Dataverse is on online platform that enables the user to:

  • access Ten to Men datasets, once approved
  • access Ten to Men data documentation, such as the Data User Guide, Data Dictionary, questionnaires and Data Issue Paper.

The Ten to Men datasets are available free of charge for download by approved data users from the ADA in SAS, STATA and SPSS formats.
A single dataset is provided combining data from all questionnaires for each wave of data. Data from the Parent questionnaire has been included with the relevant Boy participant unit record. Other confidentialised information is available in the dataset at the unit record level, including basic demographic information, recruitment area and household identifiers, area level variables, and meta-data related to participation in each wave of data collection. Personal or identifying information of Ten to Men participants is not available.
7.5 Follow up requirements
Users must provide a written update on the progress of their project every six months and on completion of the project. Progress updates will include the stage of the project, relevant data outputs and dissemination.
Where applicable, users will provide AIFS with a copy of valuable or important syntax files (relating to derived variables) to enable replication or verification of outputs for other researchers.
Users will receive notifications from AIFS via email regarding updates/corrections to the Ten to Men data.
7.6 Destroying data
Users must notify AIFS in writing to confirm when they are finished using the Ten to Men data. This includes:

  • no longer conducting analyses with the data or producing outputs
  • no longer producing publications or reports using the data
  • no longer presenting findings using the data.

Users are then required to delete all data and any datasets, including new variables created by manipulating the data, by reformatting or rewriting the data. An End of Project form must be completed to verify this.

References

References

  • Australian Bureau of Statistics. (2011). Australian Statistical Geography Standard (ASGS): Volume 5 - Remoteness Structure. Australian Bureau of Statistics: Canberra.
  • Currier, D., Pirkis, J., Carlin, J., Degenhardt, L., Dharmage, S. C., Giles-Corti, et al. (2016). The Australian longitudinal study on male health-methods. BMC Public Health, 16(Suppl 3), 1030. DOI: 10.1186/s12889-016-3698-1.
  • Kalton, G. (1983). Introduction to Survey Sampling. Newburry Park, CA: Sage Publications.
  • Spittal, M. J., Carlin J. B., Currier, D., Downes, M., English, D. R., Gordon, I. et al. (2016). The Australian longitudinal study on male health sampling design and weighting: Implications for analysis and interpretation of clustered data. BMC Public Health, 16(Suppl 3), 1062. DOI: 10.1186/s12889-016-3699-0.
Appendix A: Dataset releases

Appendix A: Dataset releases
Table showing dataset releases

DateReleaseDatasetSuggested citation and DOI

July 2016
Release 1.0
Wave 1
Pirkis, J., English, D., & Currier, D. (2016). The Australian Longitudinal Study on Male Health (Ten to Men), 2013 [computer file]. Canberra: Australian Data Archive, The Australian National University, 2016. 

  • DOI:10.4225/87/587ebdbc851b1

August 2017
Release 2.0
Respondent 
Wave 1
Wave 2

Pirkis, J., English, D., & Currier, D. (2017). The Australian Longitudinal Study on Male Health (Ten to Men), 2013 [computer file]. Canberra: Australian Data Archive, The Australian National University, 2017. 

  • Respondent DOI: 10.4225/87/N8C9NP
  • Wave 1 DOI: 10.4225/87/Z4PEZN
  • Wave 2 DOI: 10.4225/87/2KHTSV

September 2019
Release 2.1
Wave 1 
Wave 2

Bandara, D., Howell, L., & Daraganova, G. (2019). Ten to Men: The Australian Longitudinal Study on Male Health - Data Release 2.1, September 2019. Melbourne: Australian Institute of Family Studies. 

  • DOI: 10.26193/V2IVIG
Glossary

Glossary
Glossary of terms

TermDescription

ABS
Australian Bureau of Statistics

ADA
Australian Data Archive

AIFS
Australian Institute of Family Studies

ANZSCO
Australian and New Zealand Standard Classification of Occupations

ASCL
Australian Standard Classification of Languages

ASGS
Australian Statistical Geographic Standards

CAPI
Computer-Assisted Personal Interview

Data Books
Documents providing the frequency distribution of variables by questionnaire

Data Dictionary
Spreadsheet listing all variables and their attributes

DLIA
Data Linkage and Integrating Authority

DoH
Department of Health

DOI
Digital Object Identifier

MB
Mesh Block

MMM
Modified Monash Model

NHMRC
National Health and Medical Research Council

RA
Remoteness Area

RMR
Roy Morgan Research (fieldwork agency commissioned to carry out Waves 1 and 2 fieldwork)

SA1
Statistical Area 1

SA2
Statistical Area 2

SACC
Standard Australian Classification of Countries

SEIFA
Socio-Economic Indexes for Areas

TTM
Ten to Men study

UoM
University of Melbourne

Acknowledgements

Ten to Men: The Australian Longitudinal Study on Male Health was commissioned by the Commonwealth Department of Health. The study was initially conducted by the University of Melbourne who released datasets, including data documentation, for Wave 1 and Wave 2. Roy Morgan Research undertook the data collection and initial data processing for these two waves.

After a competitive tender process in 2017, the Australian Institute of Family Studies (AIFS) was awarded with the tender to conduct Wave 3. Since then, the Wave 1 and Wave 2 datasets, including data documentation, have been updated by AIFS.

Publication details

Data User Guide
Published by the Australian Institute of Family Studies, September 2019
Suggested citation:

Bandara, D., Howell, L., Silbert, M., Mohal, J., Garrard, B., & Daraganova, G. (2019). Ten to Men: The Australian Longitudinal Study on Male Health - Data User Guide, Version 3.0, September 2019. Melbourne: Australian Institute of Family Studies.

Download Publication