|
Name |
|
|
Dimensional Type |
Dimension |
|
Comment |
Includes all students with an academic level code of UG, which means degree seekers (and since Carleton grants only bachelor's degrees, the UG implies people looking for a bachelor's degree). As with all 'people' dimensions, this dimension is heavy with type II SCDs, which means that it keeps history. Unfortunately, it only keeps history back to the point when the data warehouse was first deployed for degree students (late 2009). What this means, in practice, is that if you are looking at facts and fact tables with data from before late 2009, any rows that join to this dimension will join to the closest available row. Hence, as you go back in time from 2009, type-II SCDs in this dimension (like student enrollment status) will become less and less accurate. That is, they will reflect the student's status as of 2009 and not at the time that the fact in question was recorded. |
|
Description! |
|
Annotation! |
|
CKT_MD_DEGREESTUDENT |
|
Name |
Child Table |
Foreign Key Columns |
|
AssetValueToDegreeStudent |
MD_CollPers_SurrogateKeyDStu |
|
|
DStuEnrolledForRegularTerm |
MD_CollPers_SurrogateKey |
|
|
DStuReceivedGradeForRegularTerm |
MD_CollPers_SurrogateKey |
|
|
Relationship_17 |
MD_CollPers_SurrogateKey |
|
|
Relationship_35 |
MD_CollPers_SurrogateKey |
|
|
Relationship_37 |
MD_CollPers_SurrogateKey |
|
|
Relationship_83 |
MD_CollPers_SurrogateKey |
|
|
Relationship_85 |
MD_CollPers_SurrogateKeyDStu |
|
|
Relationship_88 |
MD_CollPers_SurrogateKeyDStu |
|
|
Relationship_91 |
MD_CollPers_SurrogateKey |
|
|
Relationship_122 |
MD_CollPers_SurrogateKeyDStu |
|
|
Relationship_145 |
MD_CollPers_SurrogateKeyDStu |
|
|
Relationship_170 |
MD_CollPers_SurrogateKey |
|
|
Relationship_171 |
MD_CollPers_SurrogateKeyDStu |
|
|
Relationship_171 |
MD_CollPers_SurrogateKey |
|
|
Relationship_173 |
MD_CollPers_SurrogateKeyDStu |
|
|
Relationship_180 |
MD_CollPers_SurrogateKeyDStu |
|
|
Relationship_235 |
MD_CollPers_SurrogateKeyEmp |
|
Name |
Code |
|
Name |
|
Name |
Comment |
Domain |
Data Type |
Length |
|
MD_CollPers_SurrogateKey |
|
<None> |
numeric |
|
|
MD_Pers_AlienStatus |
As opposed to MD_Pers_IsNonResidentAlien, which simply classifies a person as a nonresident alien or not (and is normally more than sufficient for reporting), this field houses detailed alien status information. |
DW_CodeDesc |
varchar(32) |
32 |
|
MD_DStuRowStartYear |
Used for fast retrieval of information; do NOT expose to users. |
<None> |
int |
|
|
MD_ColPers_ColleagueID |
Unique identifier in Colleague. Typically not exposed in reports. Used for internal joins and views within the data warehouse. If you think you need to print and analyze this attribute, think again - or check with the data warehouse team. Consider using Application ID instead. |
DW_ColleagueID |
char(7) |
7 |
|
MD_Pers_EnterpriseUniqueID |
Unique "person" ID assigned to everyone connected with Carleton. Maintained by the identity management system. Should be mandatory, but we cannot make it so until the IdM system is complete and fully functioning - and we can call out to it in our ETL code. |
DW_EnterpriseUniqueID |
char(8) |
8 |
|
MD_Pers_CommonName |
Full legal name, with middle name or initial. Does not include honorifics unless these are part of the legal name and are available in the system from which the data is derived. This field is mandatory and has no default. |
DW_CommonName |
nvarchar(64) |
64 |
|
MD_Pers_Surname |
Surname in the US means the same thing as "last name." Does not include suffixes like Jr., or honorifics. |
DW_Surname |
nvarchar(32) |
32 |
|
MD_Pers_GivenName |
Given name in the US typically means the same thing as "first name." Default value is "missing". |
DW_GivenName |
nvarchar(32) |
32 |
|
MD_Pers_MiddleName |
Middle name or initial (if available). This field is fairly "dirty" and poorly maintained in some of our systems, especially in Recruitment Plus (not because our staff is doing a poor job, but because they sources from which they obtain their data are of uneven quality). This field should be avoided, therefore, if possible, particularly for prospective student data. |
DW_GivenName |
nvarchar(32) |
32 |
|
MD_Pers_Gender |
Must be Male, Female, or Unknown. |
DW_Gender |
varchar(7) |
7 |
|
MD_Pers_PrimaryAffiliation |
Main relationship to Carleton. Values should be taken from the identity management system, if available. Defaults to 'Unknown'. Values are taken from a controlled vocabulary including: Student, Faculty, Staff, Alum, Parent, Friend. |
DW_CarlAffiliation |
varchar(16) |
16 |
|
MD_Pers_Citizenship |
A country name, not code. It is not guaranteed that all values for this attribute will be consistent across all dimensions. For example, citizenship for a prospective student in MD_Prospect (populated from Recruitment Plus) may be "Czek Republic," while citizenship for that same person in MD_DegreeStudent may be "Czekoslovakia". It would be nice to have these values all be consistent across the campus, but doing so would constitute a project in and of itself. Defaults to 'Unknown'. |
DW_Addr_Country |
nvarchar(64) |
64 |
|
MD_Pers_AgeRange |
Age of person, discretized into a set of ranges useful for reporting and demographic classification. If age is unknown, the value of this attribute will be "Unknown." |
DW_AgeRange |
varchar(12) |
12 |
|
MD_Pers_HomeAddrCity |
City of home address. E.g., if my home address is 203 Woodley Street West, Northfield, MN 55057 USA, then my city is 'Northfield'. If my home address is unavailable, and all we have is a mailing address, we use the mailing address (in preference to nothing at all) because, presumably, I receive mail somewhere near where I live, or in a place I spend a lot of time. |
DW_Addr_City |
nvarchar(32) |
32 |
|
MD_Pers_HomeAddrNonUSProvince |
|
DW_Addr_StateOrProvince |
nvarchar(64) |
64 |
|
MD_Pers_HomeAddrUSState |
State of home address. E.g., if my home address is 203 Woodley Street West, Northfield, MN 55057 USA, then my state is 'Minnesota'. If I live outside the US, this attribute will be 'Not Applicable'. If no home address is available for me, the value will be 'Unknown'. If my home address is unavailable, and all we have is a mailing address, we use the mailing address (in preference to nothing at all) because, presumably, I receive mail somewhere near where I live, or in a place where I spend a lot of time. |
DW_Addr_USState |
varchar(24) |
24 |
|
MD_Pers_HomeAddr5DigitUSZip |
5-digit US zipcode. Five digits (and not nine) because this attribute is intended for aggregate spatial analysis, i.e., for grouping people and things into geographical units. It's not meant to identify individual people or addresses, which is not something one typically does in a data warehouse. If you need details about where someone lives, exactly, use Microsoft Access to join data here to data in the appropriate transactional system (e.g., Recruitment Plus, Colleague, or Advance). Defaults to '00000'. |
DW_Addr_5DigitUSZipcode |
char(5) |
5 |
|
MD_Pers_HomeAddrUSRegionFIPS |
Contains the FIPS US region (e.g., "Middle Atlantic" or "New England"). Defaults to "Unknown." Used for various reporting. Compare the USRegionCarleton, which is also used for institutional reporting. Everyone is encouraged to use one of these two systems, if at all possible - for consistency's sake. |
DW_USRegion |
varchar(64) |
64 |
|
MD_Pers_HomeAddrUSRegionCarleton |
Contains the US region (e.g., "Middle States" or "New England"). Defaults to "Unknown." Used for various Carleton reports. Compare the USRegionFIPS, which is also used for various reports. Everyone is encouraged to use one of these two systems, if at all possible - for consistency's sake. |
DW_USRegion |
varchar(64) |
64 |
|
MD_Pers_HomeAddrCountry |
DO NOT USE THIS FIELD FOR SERIOUS REPORTING. The data is entered for operational/addressing purposes, and is not of sufficiently high quality to work for analysis. |
DW_Addr_Country |
nvarchar(64) |
64 |
|
MD_Pers_HomeAddrLatitude |
Latitude of home address, as a floating point number. Best populated for US addresses, and populated in a very coarse fashion for those (using the zipcode) before late summer 2011. |
<None> |
float |
|
|
MD_Pers_HomeAddrLongitude |
Longitude of home address, as a floating point number (negative = west longitude). Best populated for US addresses, and populated in a very coarse fashion for those (using the zipcode) before late summer 2011. |
<None> |
float |
|
|
MD_Pers_HomeAddrMilesFromCampus |
Distance in miles from Carleton main campus. Populated using the home address latitude and longitude, and only for US addresses with a zipcode. The value here has poor resolution and should not be used except for very general groupings (nothing with a resolution less than twenty miles). Eventually we may license a geocoding service, and populate this attribute well. Until then, YMMV. |
<None> |
float |
|
|
MD_Pers_IsNonResidentAlien |
"Yes" if the person in question is not a citizen or national of the United States and who is in the United States on a visa or temporary basis and does not have the right to remain indefinitely. "No" otherwise. |
DW_YesNoOrUnknown |
varchar(7) |
7 |
|
MD_Pers_IsOfUnknownRaceEthnicity |
"Yes" if neither the race nor ethnicity is known. "No" otherwise. Calculated from Race and IsHispanicOrLatino, as well as from OLDPrimaryRaceList. If the old race list is empty or 'Unknown' and if Race is 'Unknown', and IsHispanicOrLatino = 'Unknown' or 'No', then this attribute will be 'No'. It basically means we don't know enough to classify someone by ethnicity or race, in either the old or the new 2010 census system. |
DW_YesOrNoNOTNULL |
char(3) |
3 |
|
MD_Pers_IsHispanicOrLatino |
Census requires that we ask first if a person is Hispanic, Latino, or of Spanish origin. This is the 'ethnicity' question. We may then ask a 'race' question. If the person in question lists Hispanic ethicity (or in the 'old' pre-2010 system) lists Hispanic as their 'race' then this attribute will be 'Yes'. If not, then 'no'. If no information is available, then 'Unknown'. |
DW_YesNoOrUnknown |
varchar(7) |
7 |
|
MD_Pers_Race |
Holds the person's race. This attribute can be treated as "primary race" although the way this information is collected makes the two not entirely equivalent, since in some cases Carleton may choose to collect fuller information on people of multiple races. |
DW_RaceForCensus2010 |
varchar(64) |
64 |
|
MD_Pers_RaceList |
Comma-separated list of all races a person claims to belong to, ordered (by preference? alphabetically? by an obscure "trumping" system we concoct?). |
DW_CommaDelimitedList |
varchar(128) |
128 |
|
MD_Pers_OLDPrimaryRace |
Until the census-2010 race/ethnicity system is fully implemented, this is our main "race" attribute. It should eventually become, hence the designation "old." It will still be populated, however, even after we have converted to the new census 2010 system, so that we can report easily across multiple years, both before and after the transition. And it is the only attribute you can reliably use to report on ethnicity data, longitudinally, before and after the transition (which began in late 2009). |
DW_CodeDescLong |
varchar(64) |
64 |
|
MD_Pers_OLDRaceList |
Comma-separated list of races a person claims to belong to, e.g., "Asian, White". Follows the order of races provided by the source system, which may or may not be ordered. Defaults to the empty string. Note that this attribute will still be populated, even after we have converted to the new census-2010 race/ethnicity system, so that we can report easily across multiple years, both before and after the transition. |
DW_CommaDelimitedList |
varchar(128) |
128 |
|
MD_Pers_Disability1 |
First of max 3 possible disabilities we track. Used for EOE purposes. Feds will be asking us to track this for students. Possibly needed for alumni and admissions statistics/reporting as well. |
DW_Disability |
varchar(32) |
32 |
|
MD_Pers_Disability2 |
|
DW_Disability |
varchar(32) |
32 |
|
MD_Pers_Disability3 |
|
DW_Disability |
varchar(32) |
32 |
|
MD_Pers_DisabilityList |
Comma-separated list of all disabilities. Feds will be asking us to track this for staff and students. |
DW_CommaDelimitedList |
varchar(128) |
128 |
|
MD_Pers_IsVeteran |
"Yes" if a person is a veteran by federal government reporting standards, particularly EOE, for employees. "No" otherwise. Although the database also allows "Unknown" here, this value is not used. |
DW_YesNoOrUnknown |
varchar(7) |
7 |
|
MD_Stu_AcadLevel |
Colleague academic level names, Undergraduate (UG), Summer Graduate (SG), and Non-Degree (ND). We use the description here, not the 'code,' for human readability. If a student has graduated, then the value here will be 'Not applicable.' Ditto for students who have received aid offers, etc., but who have not matriculated. |
DW_AcademicLevel |
varchar(30) |
30 |
|
MD_DStu_RecruitmentPlusPKID |
Recruitment Plus PKID used for joins to prospect/Admissions tables. This attribute may be NULL and therefore should not be exposed to most users. Do not use this attribute in typical reports or queries. This attribute is an integer (as in Recruitment Plus). |
DW_RecruitmentPlusPKID |
int |
|
|
MD_DStu_AdvanceID |
Advance ID used for joins to alumni tables. This attribute may be NULL and therefore should not be exposed to most users. Do not use this attribute in typical reports or queries. This attribute is a ten-character, zero-left-padded string of digits, e.g., 0000012481. |
DW_AdvanceID |
char(10) |
10 |
|
MD_DStu_IsFirstTimeCollege |
"Yes" if student is a first-time student. Some reports require us to select out only first-time full-time students who started as freshmen. |
DW_YesOrNoNOTNULL |
char(3) |
3 |
|
MD_DStu_EnteredAsTransfer |
"Yes" if a given student entered as a transfer student. "No" otherwise. This attribute is a type 1 SCD, which means that even though it indicates how a student entered Carleton, it is applied to all RowStart/RowEnd times. In other words, once a "yes," always a "yes" here, unless the student's program changes fundamentally (e.g., leaves and is readmitted). |
DW_YesOrNoNOTNULL |
char(3) |
3 |
|
MD_DStu_EnteredAsFullTime |
"Yes" if a given student entered as a full-time student. "No" otherwise. |
DW_YesNoOrUnknown |
varchar(7) |
7 |
|
MD_DStu_LegacyType |
This attribute indicates whether a student is a first-generation college student or a legacy of some type. Only a single value is stored here (so a person can't be a first-generation college student, and a parent-legacy). What gets stored here is determined by what is in Recruitment Plus at the time of a student's application. |
DW_CodeDesc |
varchar(32) |
32 |
|
MD_DStu_EnteringCohortYear |
Year in which student first enrolled and was assigned a cohort. Not all students are assigned a cohort. Students without an assigned cohort get assigned a nonsense year of 9999 here, in lieu of NULL or an empty string. See also the FinancialAidPerson dimension, where the cohort typically exists, but is provisional for many people (who have applied for/been offered aid, but have not matriculated). |
DW_YearAtCarleton |
int |
|
|
MD_DStu_GraduatingCohortYear |
Actual, or projected, graduating class year for a given student. The procedure for assigning a value to this field is complex, and if a student makes slower or faster than anticipated academic progress, their class year may change periodically. |
DW_YearAtCarleton |
int |
|
|
MD_DStu_CommencementDate |
This field should not be exposed to users except via calculations. It is used, in particular, to calculate IPEDS graduation years, so the actual day of the month is not needed. We basically only use the year. |
<None> |
datetime |
|
|
MD_DStu_CohortCode |
The cohort code is used for grouping students when creating various key (e.g., IPEDS) reports. It is set once for a student, and then forever kept the same. Hence we have implemented it here as a type-I SCD attribute. |
DW_CodeDescShort |
varchar(16) |
16 |
|
MD_DStu_CatalogYear |
Used for degree audit; tells us what set of requirements applies to a given student. If unknown, defaults to 9999. |
DW_YearAtCarleton |
int |
|
|
MD_DStu_EntryTerm |
Term of initial entry to Carleton (e.g., "09/FA"). Defaults to 'Unknown' if the value here is unknown, not applicable, etc. |
DW_TermShortName |
char(7) |
7 |
|
MD_DStu_TermsCompleted |
Count of terms completed as part of degree program. This attribute is often used as a divisor. E.g., count of terms as varsity athlete / terms completed. |
<None> |
int |
|
|
MD_DStu_TermsInResidence |
Terms in residence, for graduates, is a KPI, because it tells us how much (potential) revenue can be gained from each student. Those who graduate with fewer terms in residence, that is, cost us tuition money. |
<None> |
int |
|
|
|
|