List of columns of the table MD_NonDegreeStudent

Name

Comment

Domain

Data Type

Length

MD_Pers_EnterpriseUniqueID

Unique "person" ID assigned to everyone connected with Carleton. Maintained by the identity management system. Should be mandatory, but we cannot make it so until the IdM system is complete and fully functioning - and we can call out to it in our ETL code.

This attribute is currently not populated.

DW_EnterpriseUniqueID

char(8)

8

MD_Pers_CommonName

Full legal name, with middle name or initial. Does not include honorifics unless these are part of the legal name and are available in the system from which the data is derived. This field is mandatory and has no default.

Note that this field is 'calculated' for prospective students (i.e., put together out of the given name, middle name, and surname). Because the middle name is not well maintained for prospective students, the common name will be "dirty" as well. It is recommended that this field be avoided, therefore, for prospective students. Also, note: If you believe names are needed for prospective students, then consider also whether the report you are creating should be generated off of Recruitment Plus, and not the data warehouse.

Common names in Colleague should be very clean, and as a result common names in the data warehouse for students, faculty, and staff may be relied upon. If you need names for a report, however, consider reporting directly off of Colleague. Typically the data warehouse is best for aggregate analysis, and if you are using individual names, you may find other sources of information to be better.

Common names in Advance are also relatively clean. Hence also are common names in the warehouse for alumni and other constituencies maintained by External Relations.

DW_CommonName

nvarchar(64)

64

MD_Pers_Surname

Surname in the US means the same thing as "last name." Does not include suffixes like Jr., or honorifics.

This attribute is mandatory and has no default.

DW_Surname

nvarchar(32)

32

MD_Pers_GivenName

Given name in the US typically means the same thing as "first name." Default value is "missing".

DW_GivenName

nvarchar(32)

32

MD_Pers_MiddleName

Middle name or initial (if available). This field is fairly "dirty" and poorly maintained in some of our systems, especially in Recruitment Plus (not because our staff is doing a poor job, but because they sources from which they obtain their data are of uneven quality). This field should be avoided, therefore, if possible, particularly for prospective student data.

The default value is "missing".

DW_GivenName

nvarchar(32)

32

MD_Pers_Gender

Must be Male, Female, or Unknown.

DW_Gender

varchar(7)

7

MD_Pers_PrimaryAffiliation

Main relationship to Carleton. Values should be taken from the identity management system, if available. Defaults to 'Unknown'. Values are taken from a controlled vocabulary including: Student, Faculty, Staff, Alum, Parent, Friend.

Where populated, this attribute is an excellent measure of general affiliation. It is not, however, populated for all 'people' dimensions.

DW_CarlAffiliation

varchar(16)

16

MD_Pers_Citizenship

A country name, not code. It is not guaranteed that all values for this attribute will be consistent across all dimensions. For example, citizenship for a prospective student in MD_Prospect (populated from Recruitment Plus) may be "Czek Republic," while citizenship for that same person in MD_DegreeStudent may be "Czekoslovakia". It would be nice to have these values all be consistent across the campus, but doing so would constitute a project in and of itself. Defaults to 'Unknown'.

DW_Addr_Country

nvarchar(64)

64

MD_Pers_AgeRange

Age of person, discretized into a set of ranges useful for reporting and demographic classification. If age is unknown, the value of this attribute will be "Unknown."

Note that the age is calculated from the birth date, and does not take death into account. The age range, therefore, indicates the age if the person were alive (and in most cases they are).

DW_AgeRange

varchar(12)

12

MD_Pers_HomeAddrCity

City of home address. E.g., if my home address is 203 Woodley Street West, Northfield, MN 55057 USA, then my city is 'Northfield'. If my home address is unavailable, and all we have is a mailing address, we use the mailing address (in preference to nothing at all) because, presumably, I receive mail somewhere near where I live, or in a place I spend a lot of time.

Note that this field often contains extra information if the city is outside the US, so for non-US cities YMMV, i.e., you may not get the results you're looking for. For example, the postal code for foreign addresses may be coded after the city. The country may also be tacked on.

This field is therefore not suitable for serious analysis.

DW_Addr_City

nvarchar(32)

32

MD_Pers_HomeAddrUSState

State of home address. E.g., if my home address is 203 Woodley Street West, Northfield, MN 55057 USA, then my state is 'Minnesota'. If I live outside the US, this attribute will be 'Not Applicable'. If no home address is available for me, the value will be 'Unknown'. If my home address is unavailable, and all we have is a mailing address, we use the mailing address (in preference to nothing at all) because, presumably, I receive mail somewhere near where I live, or in a place where I spend a lot of time.

Armed forces 'states' (really postal routing codes) as of 2012 still are not expanded. E.g., if someone's address is 'APO AA' (Army Post Office, Armed Forces Americas), the city will be 'Apo' and the state will be 'AA' as of 2012. In other words, military postal codes of AA, AE, AP will not be expanded to a longer name as of 2012. This leads to awkward 'state' lists in some reports, where you see, e.g., AA, AE, and AP alongside Pennsylvania, Illinois, Rhode Island, Minnesota, etc.

DW_Addr_USState

varchar(24)

24

MD_Pers_HomeAddr5DigitUSZip

5-digit US zipcode. Five digits (and not nine) because this attribute is intended for aggregate spatial analysis, i.e., for grouping people and things into geographical units. It's not meant to identify individual people or addresses, which is not something one typically does in a data warehouse. If you need details about where someone lives, exactly, use Microsoft Access to join data here to data in the appropriate transactional system (e.g., Recruitment Plus, Colleague, or Advance). Defaults to '0000'.

DW_Addr_5DigitUSZipcode

char(5)

5

MD_Pers_HomeAddrUSRegionFIPS

Contains the FIPS US region (e.g., "Middle Atlantic" or "New England"). Defaults to "Unknown." Used for various reporting. Compare the USRegionCarleton, which is also used for institutional reporting. Everyone is encouraged to use one of these two systems, if at all possible - for consistency's sake.

FIPS codes, in general, are a set of standardized numeric or alphabetic codes issued by the US National Institute of Standards and Technology (NIST) to ensure uniform identification of geographic entities throughout all federal government agencies.

DW_USRegion

varchar(64)

64

MD_Pers_HomeAddrUSRegionCarleton

Contains the US region (e.g., "Middle States" or "New England"). Defaults to "Unknown." Used for various Carleton reports. Compare the USRegionFIPS, which is also used for various reports. Everyone is encouraged to use one of these two systems, if at all possible - for consistency's sake.

DW_USRegion

varchar(64)

64

MD_Pers_HomeAddrCountry

DO NOT USE THIS FIELD FOR SERIOUS REPORTING. The data is entered for operational/addressing purposes, and is not of sufficiently high quality to work for analysis.

In Colleague, at least as of 2011, international addresses are not handled well, and therefore the country may not be well or correctly populated. Colleague, for example (again, as of 2011), has no place for a foreign postal code, so often people place that, along with the country, in the space that's supposed to house the city. This makes things print correctly, but it means the country will not be populated. And since the data warehouse defaults to 'USA' if this field is blank (which is generally appropriate, across most systems), this means that many international addresses end up in the warehouse with a country of the United States.

In general, therefore, it is wise to use MD_Pers_Citizenship instead of MD_Pers_HomeAddrCountry, although the challenge there is that citizenship is designated in different ways across systems (not necessarily with consistent ISO country names or codes).

Note that country data for prospective students, especially ones who are only 'suspects' (early in the admissions cycle - not applicants or admits) may not be wholly reliable, because we obtain their addresses from third parties who don't consistently mark the country.

DW_Addr_Country

nvarchar(64)

64

MD_Pers_HomeAddrLatitude

Latitude of home address, as a floating point number. Best populated for US addresses, and populated in a very coarse fashion for those (using the zipcode) before late summer 2011.

Warning: Note that there is no default, and the value here may be NULL.

Values for this attribute from before late summer 2011 are not truly fit for user consumption, except by special request, and should not (yet) be exposed via views.

Note that the longitude and latitude for a given address may change over time, without any change in the address, because we occasionally change geocoding providers. The geocoding providers are also continually improving their algorithms. If a longitude or latitude changes, without any address change, a new row is generated in the data warehouse. But history is not rewritten. The earlier, presumably less accurate figure remains in non-current rows.


<None>

float

 

MD_Pers_HomeAddrLongitude

Longitude of home address, as a floating point number (negative = west longitude). Best populated for US addresses, and populated in a very coarse fashion for those (using the zipcode) before late summer 2011.

Warning: Note that there is no default, and the value here may be NULL.

Values for this attribute from before late summer 2011 are not truly fit for user consumption, except by special request, and should not (yet) be exposed via views.

Note that the longitude and latitude for a given address may change over time, without any change in the address, because we occasionally change geocoding providers. The geocoding providers are also continually improving their algorithms. If a longitude or latitude changes, without any address change, a new row is generated in the data warehouse. But history is not rewritten. The earlier, presumably less accurate figure remains in non-current rows.

<None>

float

 

MD_Pers_HomeAddrMilesFromCampus

Distance in miles from Carleton main campus. Populated using the home address latitude and longitude, and only for US addresses with a zipcode. The value here has poor resolution and should not be used except for very general groupings (nothing with a resolution less than twenty miles). Eventually we may license a geocoding service, and populate this attribute well. Until then, YMMV.

This attribute is not 'fit' for user consumption, except by special request, and should not (yet) be exposed via views.

Warning: Note that there is no default, and the value here may be NULL.

<None>

float

 

MD_Pers_IsNonResidentAlien

"Yes" if the person in question is not a citizen or national of the United States and who is in the United States on a visa or temporary basis and does not have the right to remain indefinitely. "No" otherwise.

IPEDS does not require us to report the ethnicity/race of nonresident aliens, so (although we may collect race/ethnicity and alien status independently) when reporting to IPEDS, "Non-Resident Alien" becomes like a race, i.e., you're white, black, etc., or a nonresident alien. To do IPEDS reporting, most people-based views in the data warehouse have an IPEDS ethnicity attribute that includes nonresident alien as a race. Use that attribute for IPEDS reporting.

Use this attribute if you simply want to divide a population up into two piles, one for nonresident alieans and another for other people.

DW_YesNoOrUnknown

varchar(7)

7

MD_Pers_IsOfUnknownRaceEthnicity

"Yes" if neither the race nor ethnicity is known. "No" otherwise. Calculated from Race and IsHispanicOrLatino, as well as from OLDPrimaryRaceList. If the old race list is empty or 'Unknown' and if Race is 'Unknown', and IsHispanicOrLatino = 'Unknown' or 'No', then this attribute will be 'No'. It basically means we don't know enough to classify someone by ethnicity or race, in either the old or the new 2010 census system.

DW_YesOrNoNOTNULL

char(3)

3

MD_Pers_IsHispanicOrLatino

Census requires that we ask first if a person is Hispanic, Latino, or of Spanish origin. This is the 'ethnicity' question. We may then ask a 'race' question. If the person in question lists Hispanic ethicity (or in the 'old' pre-2010 system) lists Hispanic as their 'race' then this attribute will be 'Yes'. If not, then 'no'. If no information is available, then 'Unknown'.

Note that some systems (e.g., Colleague) store only one race/ethnicity. Recall also that before 2010 Hispanic was considered a race. Hence if someone considers themselves primarily African American, but secondarily Hispanic, the Hispanic ethnicity will not be stored in Colleague (at least before the 2010 census standards kicked in). And as a result, this attribute will, incorrectly, be registered as 'No' in some cases.

DW_YesNoOrUnknown

varchar(7)

7

MD_Pers_Race

Holds the person's race. This attribute can be treated as "primary race" although the way this information is collected makes the two not entirely equivalent, since in some cases Carleton may choose to collect fuller information on people of multiple races.

This attribute (as opposed to MD_Pers_OldPrimaryRace) does not tell us whether a person is of Hispanic, Latino, or Spanish origin.

Note that in the new ethnicity/race classification system (2010 census and beyond), multi-racial people cannot indicate what races they belong to. They are simply multi-racial. Also, Hispanic is not considered a race, as noted. It's an ethnicity.

Hence this attribute will always have a single value, one of:

African American
American Indian-Alaska Native
Asian
Hawaiian-Pacific Islander
Two or More Races
Unknown
White

This attribute is not populated before 2009, and is unevently populated until 2011 (as of 2011, employees have not been fully resurveyed).

Note that some systems store only one race/ethnicity. Others do not. The data here will therefore be of uneven completeness and consistency. When reporting, stick with a single system of record, or a single constituency (like 'prospective students') and you'll be fine. Otherwise, be extremely careful, and be sure to work with people who administer the source systems who can advise you.

DW_RaceForCensus2010

varchar(64)

64

MD_Pers_RaceList

Comma-separated list of all races a person claims to belong to, ordered (by preference? alphabetically? by an obscure "trumping" system we concoct?).

Use this attribute as a filter, to select out people by racial affiliation (e.g., WHERE MD_Pers_OLDRaceList LIKE '%African%').

Note that this attribute is populated using information that the US government doesn't require us to collect. According to 2010 census standards, a multi-racial person is just multi-racial (i.e., "multi-racial" is their race). No further information is collected. Also, Hispanic is not a race.

This attribute is currently not populated, and will not be until Carleton implements the 2010 census ethnicity/race classification system across its enterprise systems.

DW_CommaDelimitedList

varchar(128)

128

MD_Pers_OLDPrimaryRace

Until the census-2010 race/ethnicity system is fully implemented, this is our main "race" attribute. It should eventually become, hence the designation "old." It will still be populated, however, even after we have converted to the new census 2010 system, so that we can report easily across multiple years, both before and after the transition. And it is the only attribute you can reliably use to report on ethnicity data, longitudinally, before and after the transition (which began in late 2009).

Possible values of this attribute include:

Asian
Black or African American
Hispanic
Mixed race or other
Native American-Alaskan Native
Unknown
White

The primary race is not necessarily selected by the person reflected in the data. For students it is inferred using a 'trumping' system that's complex and that most folks don't understand fully (e.g., if you're white and African American, the latter trumps the former and becomes your primary ethnicity). In this old system, 'Hispanic' was considered a race. Or, conversely, everything was an ethnicity. In the new 2010 census system, 'Hispanic' is an ethnicity, and most everything else ('Asian', 'White', etc.) is a race.

When reporting, stick with a single system of record, or a single constituency (like prospective students or degree students) and you'll be fine. Otherwise, be extremely careful, and be sure to work with people who administer the source systems who can advise you.

Note also that, as per institutional standards, ethnicities like 'Unknown' are not automatically counted as white, as sometimes happens in reports generated, e.g., by Student Financial Services and other offices.

DW_CodeDescLong

varchar(64)

64

MD_Pers_OLDRaceList

Comma-separated list of races a person claims to belong to, e.g., "Asian, White". Follows the order of races provided by the source system, which may or may not be ordered. Defaults to the empty string. Note that this attribute will still be populated, even after we have converted to the new census-2010 race/ethnicity system, so that we can report easily across multiple years, both before and after the transition.

Use this attribute as a filter, to select out people by racial affiliation (e.g., WHERE MD_Pers_OLDRaceList LIKE '%African%').

This attribute uses the "old" coding standards where the primary race is determined by a "trumping" system we utilize (it's not based entirely on what the person tells us). It also treats Hispanic as a race, which the 2010 census standards have changed (Hispanic is now an ethnicity; a Hispanic person can be of any race).

DW_CommaDelimitedList

varchar(128)

128

MD_Pers_Disability1

First of max 3 possible disabilities we track. Used for EOE purposes. Feds will be asking us to track this for students. Possibly needed for alumni and admissions statistics/reporting as well.

This attribute is currently, as of 2009, not populated.

DW_Disability

varchar(32)

32

MD_Pers_Disability2

 

DW_Disability

varchar(32)

32

MD_Pers_Disability3

 

DW_Disability

varchar(32)

32

MD_Pers_DisabilityList

Comma-separated list of all disabilities. Feds will be asking us to track this for staff and students.

This attribute is currently, as of 2009, not populated.

DW_CommaDelimitedList

varchar(128)

128

MD_Stu_AcadLevel

Colleague academic level names, Undergraduate (UG), Summer Graduate (SG), and Non-Degree (ND). We use the description here, not the 'code,' for human readability. If a student has graduated, then the value here will be 'Not applicable.' Ditto for students who have received aid offers, etc., but who have not matriculated.

DW_AcademicLevel

varchar(30)

30

MD_NonDegStu_Type

Non-degree students come in various flavors, including St Olaf students, High School students, etc. This field records the flavor.

DW_StudentType

char(30)

30

MD_NonDegStu_AuditKey

 

DW_AuditKey

bigint

 

MD_NonDegStu_RowStart

 

DW_RowStart

datetime

 

MD_NonDegStu_RowEnd

 

DW_RowEnd

datetime

 

MD_NonDegStu_RowIsCurrent

 

DW_BooleanNoDefault

bit

 

MD_NonDegStu_Type1Checksum

 

DW_Checksum

int

 

MD_NonDegStu_Type2Checksum

 

DW_Checksum

int

 

MD_NonDegStu_Timestamp

 

<None>

timestamp

 


1  2