Table MD_DegreeStudent

Card of table MD_DegreeStudent

Name

MD_DegreeStudent

Dimensional Type

Dimension

Comment

Includes all students with an academic level code of UG, which means degree seekers (and since Carleton grants only bachelor's degrees, the UG implies people looking for a bachelor's degree). As with all 'people' dimensions, this dimension is heavy with type II SCDs, which means that it keeps history. Unfortunately, it only keeps history back to the point when the data warehouse was first deployed for degree students (late 2009). What this means, in practice, is that if you are looking at facts and fact tables with data from before late 2009, any rows that join to this dimension will join to the closest available row. Hence, as you go back in time from 2009, type-II SCDs in this dimension (like student enrollment status) will become less and less accurate. That is, they will reflect the student's status as of 2009 and not at the time that the fact in question was recorded.

Note that this dimension records not only current, but former, students. The quality and completeness of the data becomes higher as we move forward in time. Likewise, its quality drops and it becomes less complete as we move backwards in time.

If quality and consistency are potential issues (which they almost always are), it doesn't hurt to filter out students not relevant for the particular analytical or business problem you are trying to solve. E.g., if you are looking at time-to-graduate trends, you may want to use only students whose entering cohort year is 2000 or higher - or 1993 or higher. You can certainly use all students. But just be aware that your figures may be more reliable for more recent classes.

Caution: This is a dimension table, not an isolated spreadsheet or student-data repository. if you try to use this dimension as an isolated repository of student data (without joining it to fact tables), its peculiar structure will likely create problems, e.g., with double or otherwise multiply counted rows. If what you want is a big student spreadsheet, then check with the Colleague Team. They can either give you a data dump, or they can give you access to an appropriate ODBC or other view.

This table should be used with caution if linked to facts gathered before 2009 (when we started accumulating data). This is especially true for financial aid data. We have aid data going back for many years, and we can slice/dice it up in a variety of ways using the MD_FinancialAidPerson dimension - which will give us data about aid applicants for the year in which aid was granted. If, however, we split that same data up by attributes in this table, we will only be splitting the data up by the 'best fit' row in this table. So, e.g., if we are looking at aid data for 2005, the closest available rows in this table will be from 2009. Therefore, you will be using 2009 enrollment status, age, etc. degree student data to split up 2005 aid information. The solution is not to use this table to categorize data collected from before 2009. In the case of aid data, use MD_FinancialAidPerson instead.

There is a dummy 'Unknown' employee entry in this table, to make NULL foreign keys unnecessary, even in cases where a join would normally be impossible.

Note that this is FERPA-protected information. Handle accordingly.


Description of the table MD_DegreeStudent

Description!


Annotation of the table MD_DegreeStudent

Annotation!


Check constraint name of the table MD_DegreeStudent

CKT_MD_DEGREESTUDENT


List of incoming references of the table MD_DegreeStudent

Name

Child Table

Foreign Key Columns

AssetValueToDegreeStudent

MF_IssueTrackingAssetValue

MD_CollPers_SurrogateKeyDStu

DStuEnrolledForRegularTerm

MF_SectionEnrollmentByTerm

MD_CollPers_SurrogateKey

DStuReceivedGradeForRegularTerm

MF_SectionGradeByTerm

MD_CollPers_SurrogateKey

Relationship_17

MF_AwardAmount

MD_CollPers_SurrogateKey

Relationship_35

MF_PerYearAidFacts

MD_CollPers_SurrogateKey

Relationship_37

MF_ReaderRating

MD_CollPers_SurrogateKey

Relationship_83

MF_StudentCumulativeStats

MD_CollPers_SurrogateKey

Relationship_85

MF_SurveyCellData

MD_CollPers_SurrogateKeyDStu

Relationship_88

MF_SurveyAdminFact

MD_CollPers_SurrogateKeyDStu

Relationship_91

MF_MajorCount

MD_CollPers_SurrogateKey

Relationship_122

MF_IssueTrackingTicket

MD_CollPers_SurrogateKeyDStu

Relationship_145

MF_RoomAssignment

MD_CollPers_SurrogateKeyDStu

Relationship_170

MF_SectionEnrollment

MD_CollPers_SurrogateKey

Relationship_171

MF_AlumEventParticipation

MD_CollPers_SurrogateKeyDStu

Relationship_171

MF_SectionGrade

MD_CollPers_SurrogateKey

Relationship_173

MF_AlumActivityParticipation

MD_CollPers_SurrogateKeyDStu

Relationship_180

MF_AlumContact

MD_CollPers_SurrogateKeyDStu

Relationship_235

MF_CardTransaction

MD_CollPers_SurrogateKeyEmp


List of referencing views of the table MD_DegreeStudent

Name

Code

VMD_DegreeStudent

VMD_DEGREESTUDENT

VMD_StuClassYearLevel

VMD_STUCLASSYEARLEVEL

VW_DegreeStudentGeoInformation

VW_DEGREESTUDENTGEOINFORMATION

VW_GradesByInstructorWithDynamicSecurity

VW_GRADESBYINSTRUCTORWITHDYNAMICSECURITY


List of diagrams containing the table MD_DegreeStudent

Name

DimsPeopleDiagram


List of columns of the table MD_DegreeStudent

Name

Comment

Domain

Data Type

Length

MD_CollPers_SurrogateKey

 

<None>

numeric

 

MD_Pers_AlienStatus

As opposed to MD_Pers_IsNonResidentAlien, which simply classifies a person as a nonresident alien or not (and is normally more than sufficient for reporting), this field houses detailed alien status information.

As of 2011, this field is not populated in the data warehouse, and is not even implemented in the physical data warehouse tables. It will probably need to be added and populated at some point.

DW_CodeDesc

varchar(32)

32

MD_DStuRowStartYear

Used for fast retrieval of information; do NOT expose to users.

<None>

int

 

MD_ColPers_ColleagueID

Unique identifier in Colleague. Typically not exposed in reports. Used for internal joins and views within the data warehouse. If you think you need to print and analyze this attribute, think again - or check with the data warehouse team. Consider using Application ID instead.

DW_ColleagueID

char(7)

7

MD_Pers_EnterpriseUniqueID

Unique "person" ID assigned to everyone connected with Carleton. Maintained by the identity management system. Should be mandatory, but we cannot make it so until the IdM system is complete and fully functioning - and we can call out to it in our ETL code.

This attribute is currently not populated.

DW_EnterpriseUniqueID

char(8)

8

MD_Pers_CommonName

Full legal name, with middle name or initial. Does not include honorifics unless these are part of the legal name and are available in the system from which the data is derived. This field is mandatory and has no default.

Note that this field is 'calculated' for prospective students (i.e., put together out of the given name, middle name, and surname). Because the middle name is not well maintained for prospective students, the common name will be "dirty" as well. It is recommended that this field be avoided, therefore, for prospective students. Also, note: If you believe names are needed for prospective students, then consider also whether the report you are creating should be generated off of Recruitment Plus, and not the data warehouse.

Common names in Colleague should be very clean, and as a result common names in the data warehouse for students, faculty, and staff may be relied upon. If you need names for a report, however, consider reporting directly off of Colleague. Typically the data warehouse is best for aggregate analysis, and if you are using individual names, you may find other sources of information to be better.

Common names in Advance are also relatively clean. Hence also are common names in the warehouse for alumni and other constituencies maintained by External Relations.

DW_CommonName

nvarchar(64)

64

MD_Pers_Surname

Surname in the US means the same thing as "last name." Does not include suffixes like Jr., or honorifics.

This attribute is mandatory and has no default.

DW_Surname

nvarchar(32)

32

MD_Pers_GivenName

Given name in the US typically means the same thing as "first name." Default value is "missing".

DW_GivenName

nvarchar(32)

32

MD_Pers_MiddleName

Middle name or initial (if available). This field is fairly "dirty" and poorly maintained in some of our systems, especially in Recruitment Plus (not because our staff is doing a poor job, but because they sources from which they obtain their data are of uneven quality). This field should be avoided, therefore, if possible, particularly for prospective student data.

The default value is "missing".

DW_GivenName

nvarchar(32)

32

MD_Pers_Gender

Must be Male, Female, or Unknown.

DW_Gender

varchar(7)

7

MD_Pers_PrimaryAffiliation

Main relationship to Carleton. Values should be taken from the identity management system, if available. Defaults to 'Unknown'. Values are taken from a controlled vocabulary including: Student, Faculty, Staff, Alum, Parent, Friend.

Where populated, this attribute is an excellent measure of general affiliation. It is not, however, populated for all 'people' dimensions.

DW_CarlAffiliation

varchar(16)

16

MD_Pers_Citizenship

A country name, not code. It is not guaranteed that all values for this attribute will be consistent across all dimensions. For example, citizenship for a prospective student in MD_Prospect (populated from Recruitment Plus) may be "Czek Republic," while citizenship for that same person in MD_DegreeStudent may be "Czekoslovakia". It would be nice to have these values all be consistent across the campus, but doing so would constitute a project in and of itself. Defaults to 'Unknown'.

DW_Addr_Country

nvarchar(64)

64

MD_Pers_AgeRange

Age of person, discretized into a set of ranges useful for reporting and demographic classification. If age is unknown, the value of this attribute will be "Unknown."

Note that the age is calculated from the birth date, and does not take death into account. The age range, therefore, indicates the age if the person were alive (and in most cases they are).

DW_AgeRange

varchar(12)

12

MD_Pers_HomeAddrCity

City of home address. E.g., if my home address is 203 Woodley Street West, Northfield, MN 55057 USA, then my city is 'Northfield'. If my home address is unavailable, and all we have is a mailing address, we use the mailing address (in preference to nothing at all) because, presumably, I receive mail somewhere near where I live, or in a place I spend a lot of time.

Note that this field often contains extra information if the city is outside the US, so for non-US cities YMMV, i.e., you may not get the results you're looking for. For example, the postal code for foreign addresses may be coded after the city. The country may also be tacked on.

This field is therefore not suitable for serious analysis.

DW_Addr_City

nvarchar(32)

32

MD_Pers_HomeAddrNonUSProvince

 

DW_Addr_StateOrProvince

nvarchar(64)

64

MD_Pers_HomeAddrUSState

State of home address. E.g., if my home address is 203 Woodley Street West, Northfield, MN 55057 USA, then my state is 'Minnesota'. If I live outside the US, this attribute will be 'Not Applicable'. If no home address is available for me, the value will be 'Unknown'. If my home address is unavailable, and all we have is a mailing address, we use the mailing address (in preference to nothing at all) because, presumably, I receive mail somewhere near where I live, or in a place where I spend a lot of time.

Armed forces 'states' (really postal routing codes) as of 2012 still are not expanded. E.g., if someone's address is 'APO AA' (Army Post Office, Armed Forces Americas), the city will be 'Apo' and the state will be 'AA' as of 2012. In other words, military postal codes of AA, AE, AP will not be expanded to a longer name as of 2012. This leads to awkward 'state' lists in some reports, where you see, e.g., AA, AE, and AP alongside Pennsylvania, Illinois, Rhode Island, Minnesota, etc.

DW_Addr_USState

varchar(24)

24

MD_Pers_HomeAddr5DigitUSZip

5-digit US zipcode. Five digits (and not nine) because this attribute is intended for aggregate spatial analysis, i.e., for grouping people and things into geographical units. It's not meant to identify individual people or addresses, which is not something one typically does in a data warehouse. If you need details about where someone lives, exactly, use Microsoft Access to join data here to data in the appropriate transactional system (e.g., Recruitment Plus, Colleague, or Advance). Defaults to '00000'.

DW_Addr_5DigitUSZipcode

char(5)

5

MD_Pers_HomeAddrUSRegionFIPS

Contains the FIPS US region (e.g., "Middle Atlantic" or "New England"). Defaults to "Unknown." Used for various reporting. Compare the USRegionCarleton, which is also used for institutional reporting. Everyone is encouraged to use one of these two systems, if at all possible - for consistency's sake.

FIPS codes, in general, are a set of standardized numeric or alphabetic codes issued by the US National Institute of Standards and Technology (NIST) to ensure uniform identification of geographic entities throughout all federal government agencies.

DW_USRegion

varchar(64)

64

MD_Pers_HomeAddrUSRegionCarleton

Contains the US region (e.g., "Middle States" or "New England"). Defaults to "Unknown." Used for various Carleton reports. Compare the USRegionFIPS, which is also used for various reports. Everyone is encouraged to use one of these two systems, if at all possible - for consistency's sake.

DW_USRegion

varchar(64)

64

MD_Pers_HomeAddrCountry

DO NOT USE THIS FIELD FOR SERIOUS REPORTING. The data is entered for operational/addressing purposes, and is not of sufficiently high quality to work for analysis.

In Colleague, at least as of 2011, international addresses are not handled well, and therefore the country may not be well or correctly populated. Colleague, for example (again, as of 2011), has no place for a foreign postal code, so often people place that, along with the country, in the space that's supposed to house the city. This makes things print correctly, but it means the country will not be populated. And since the data warehouse defaults to 'USA' if this field is blank (which is generally appropriate, across most systems), this means that many international addresses end up in the warehouse with a country of the United States.

In general, therefore, it is wise to use MD_Pers_Citizenship instead of MD_Pers_HomeAddrCountry, although the challenge there is that citizenship is designated in different ways across systems (not necessarily with consistent ISO country names or codes).

Note that country data for prospective students, especially ones who are only 'suspects' (early in the admissions cycle - not applicants or admits) may not be wholly reliable, because we obtain their addresses from third parties who don't consistently mark the country.

DW_Addr_Country

nvarchar(64)

64

MD_Pers_HomeAddrLatitude

Latitude of home address, as a floating point number. Best populated for US addresses, and populated in a very coarse fashion for those (using the zipcode) before late summer 2011.

Warning: Note that there is no default, and the value here may be NULL.

Values for this attribute from before late summer 2011 are not truly fit for user consumption, except by special request, and should not (yet) be exposed via views.

Note that the longitude and latitude for a given address may change over time, without any change in the address, because we occasionally change geocoding providers. The geocoding providers are also continually improving their algorithms. If a longitude or latitude changes, without any address change, a new row is generated in the data warehouse. But history is not rewritten. The earlier, presumably less accurate figure remains in non-current rows.


<None>

float

 

MD_Pers_HomeAddrLongitude

Longitude of home address, as a floating point number (negative = west longitude). Best populated for US addresses, and populated in a very coarse fashion for those (using the zipcode) before late summer 2011.

Warning: Note that there is no default, and the value here may be NULL.

Values for this attribute from before late summer 2011 are not truly fit for user consumption, except by special request, and should not (yet) be exposed via views.

Note that the longitude and latitude for a given address may change over time, without any change in the address, because we occasionally change geocoding providers. The geocoding providers are also continually improving their algorithms. If a longitude or latitude changes, without any address change, a new row is generated in the data warehouse. But history is not rewritten. The earlier, presumably less accurate figure remains in non-current rows.

<None>

float

 

MD_Pers_HomeAddrMilesFromCampus

Distance in miles from Carleton main campus. Populated using the home address latitude and longitude, and only for US addresses with a zipcode. The value here has poor resolution and should not be used except for very general groupings (nothing with a resolution less than twenty miles). Eventually we may license a geocoding service, and populate this attribute well. Until then, YMMV.

This attribute is not 'fit' for user consumption, except by special request, and should not (yet) be exposed via views.

Warning: Note that there is no default, and the value here may be NULL.

<None>

float

 

MD_Pers_IsNonResidentAlien

"Yes" if the person in question is not a citizen or national of the United States and who is in the United States on a visa or temporary basis and does not have the right to remain indefinitely. "No" otherwise.

IPEDS does not require us to report the ethnicity/race of nonresident aliens, so (although we may collect race/ethnicity and alien status independently) when reporting to IPEDS, "Non-Resident Alien" becomes like a race, i.e., you're white, black, etc., or a nonresident alien. To do IPEDS reporting, most people-based views in the data warehouse have an IPEDS ethnicity attribute that includes nonresident alien as a race. Use that attribute for IPEDS reporting.

Use this attribute if you simply want to divide a population up into two piles, one for nonresident alieans and another for other people.

DW_YesNoOrUnknown

varchar(7)

7

MD_Pers_IsOfUnknownRaceEthnicity

"Yes" if neither the race nor ethnicity is known. "No" otherwise. Calculated from Race and IsHispanicOrLatino, as well as from OLDPrimaryRaceList. If the old race list is empty or 'Unknown' and if Race is 'Unknown', and IsHispanicOrLatino = 'Unknown' or 'No', then this attribute will be 'No'. It basically means we don't know enough to classify someone by ethnicity or race, in either the old or the new 2010 census system.

DW_YesOrNoNOTNULL

char(3)

3

MD_Pers_IsHispanicOrLatino

Census requires that we ask first if a person is Hispanic, Latino, or of Spanish origin. This is the 'ethnicity' question. We may then ask a 'race' question. If the person in question lists Hispanic ethicity (or in the 'old' pre-2010 system) lists Hispanic as their 'race' then this attribute will be 'Yes'. If not, then 'no'. If no information is available, then 'Unknown'.

Note that some systems (e.g., Colleague) store only one race/ethnicity. Recall also that before 2010 Hispanic was considered a race. Hence if someone considers themselves primarily African American, but secondarily Hispanic, the Hispanic ethnicity will not be stored in Colleague (at least before the 2010 census standards kicked in). And as a result, this attribute will, incorrectly, be registered as 'No' in some cases.

DW_YesNoOrUnknown

varchar(7)

7

MD_Pers_Race

Holds the person's race. This attribute can be treated as "primary race" although the way this information is collected makes the two not entirely equivalent, since in some cases Carleton may choose to collect fuller information on people of multiple races.

This attribute (as opposed to MD_Pers_OldPrimaryRace) does not tell us whether a person is of Hispanic, Latino, or Spanish origin.

Note that in the new ethnicity/race classification system (2010 census and beyond), multi-racial people cannot indicate what races they belong to. They are simply multi-racial. Also, Hispanic is not considered a race, as noted. It's an ethnicity.

Hence this attribute will always have a single value, one of:

African American
American Indian-Alaska Native
Asian
Hawaiian-Pacific Islander
Two or More Races
Unknown
White

This attribute is not populated before 2009, and is unevently populated until 2011 (as of 2011, employees have not been fully resurveyed).

Note that some systems store only one race/ethnicity. Others do not. The data here will therefore be of uneven completeness and consistency. When reporting, stick with a single system of record, or a single constituency (like 'prospective students') and you'll be fine. Otherwise, be extremely careful, and be sure to work with people who administer the source systems who can advise you.

General documentation on how race is now stored (2010) in major systems: http://apps.carleton.edu/campus/ira/fed_race_ethnic/

DW_RaceForCensus2010

varchar(64)

64

MD_Pers_RaceList

Comma-separated list of all races a person claims to belong to, ordered (by preference? alphabetically? by an obscure "trumping" system we concoct?).

Use this attribute as a filter, to select out people by racial affiliation (e.g., WHERE MD_Pers_OLDRaceList LIKE '%African%').

Note that this attribute is populated using information that the US government doesn't require us to collect. According to 2010 census standards, a multi-racial person is just multi-racial (i.e., "multi-racial" is their race). No further information is collected. Also, Hispanic is not a race.

This attribute is currently not populated, and will not be until Carleton implements the 2010 census ethnicity/race classification system across its enterprise systems.

DW_CommaDelimitedList

varchar(128)

128

MD_Pers_OLDPrimaryRace

Until the census-2010 race/ethnicity system is fully implemented, this is our main "race" attribute. It should eventually become, hence the designation "old." It will still be populated, however, even after we have converted to the new census 2010 system, so that we can report easily across multiple years, both before and after the transition. And it is the only attribute you can reliably use to report on ethnicity data, longitudinally, before and after the transition (which began in late 2009).

Possible values of this attribute include:

Asian
Black or African American
Hispanic
Mixed race or other
Native American-Alaskan Native
Unknown
White

The primary race is not necessarily selected by the person reflected in the data. For students it is inferred using a 'trumping' system that's complex and that most folks don't understand fully (e.g., if you're white and African American, the latter trumps the former and becomes your primary ethnicity). In this old system, 'Hispanic' was considered a race. Or, conversely, everything was an ethnicity. In the new 2010 census system, 'Hispanic' is an ethnicity, and most everything else ('Asian', 'White', etc.) is a race.

When reporting, stick with a single system of record, or a single constituency (like prospective students or degree students) and you'll be fine. Otherwise, be extremely careful, and be sure to work with people who administer the source systems who can advise you.

Note also that, as per institutional standards, ethnicities like 'Unknown' are not automatically counted as white, as sometimes happens in reports generated, e.g., by Student Financial Services and other offices.

DW_CodeDescLong

varchar(64)

64

MD_Pers_OLDRaceList

Comma-separated list of races a person claims to belong to, e.g., "Asian, White". Follows the order of races provided by the source system, which may or may not be ordered. Defaults to the empty string. Note that this attribute will still be populated, even after we have converted to the new census-2010 race/ethnicity system, so that we can report easily across multiple years, both before and after the transition.

Use this attribute as a filter, to select out people by racial affiliation (e.g., WHERE MD_Pers_OLDRaceList LIKE '%African%').

This attribute uses the "old" coding standards where the primary race is determined by a "trumping" system we utilize (it's not based entirely on what the person tells us). It also treats Hispanic as a race, which the 2010 census standards have changed (Hispanic is now an ethnicity; a Hispanic person can be of any race).

DW_CommaDelimitedList

varchar(128)

128

MD_Pers_Disability1

First of max 3 possible disabilities we track. Used for EOE purposes. Feds will be asking us to track this for students. Possibly needed for alumni and admissions statistics/reporting as well.

This attribute is currently, as of 2009, not populated.

DW_Disability

varchar(32)

32

MD_Pers_Disability2

 

DW_Disability

varchar(32)

32

MD_Pers_Disability3

 

DW_Disability

varchar(32)

32

MD_Pers_DisabilityList

Comma-separated list of all disabilities. Feds will be asking us to track this for staff and students.

This attribute is currently, as of 2009, not populated.

DW_CommaDelimitedList

varchar(128)

128

MD_Pers_IsVeteran

"Yes" if a person is a veteran by federal government reporting standards, particularly EOE, for employees. "No" otherwise. Although the database also allows "Unknown" here, this value is not used.

Veteran status is available in Recruitment Plus for prospective students, although not well maintained. Veteran status is maintained in Colleague, for students, but we rarely have anything to put in this field, since so few of our students are veterans (e.g., eight as of March 2009).

DW_YesNoOrUnknown

varchar(7)

7

MD_Stu_AcadLevel

Colleague academic level names, Undergraduate (UG), Summer Graduate (SG), and Non-Degree (ND). We use the description here, not the 'code,' for human readability. If a student has graduated, then the value here will be 'Not applicable.' Ditto for students who have received aid offers, etc., but who have not matriculated.

DW_AcademicLevel

varchar(30)

30

MD_DStu_RecruitmentPlusPKID

Recruitment Plus PKID used for joins to prospect/Admissions tables. This attribute may be NULL and therefore should not be exposed to most users. Do not use this attribute in typical reports or queries. This attribute is an integer (as in Recruitment Plus).

DW_RecruitmentPlusPKID

int

 

MD_DStu_AdvanceID

Advance ID used for joins to alumni tables. This attribute may be NULL and therefore should not be exposed to most users. Do not use this attribute in typical reports or queries. This attribute is a ten-character, zero-left-padded string of digits, e.g., 0000012481.

DW_AdvanceID

char(10)

10

MD_DStu_IsFirstTimeCollege

"Yes" if student is a first-time student. Some reports require us to select out only first-time full-time students who started as freshmen.

This is a so-called type-1 attribute, meaning that once it's set, it's set for all times for a given student. Some attributes only get set 'as of' a certain date, and change as history moves on, like a zipcode or a phone number, or a student status. This attribute, however, reflects an intrinsic property of a person (or their state upon entry to Carleton), and applies universally, much like a gender (well, usually) or an ethnicity.

Right now we 'calculate' the value of this attribute by taking the inverse of the EnteredAsTransfer attribute. If somebody entered as a transfer, they get a "No" here. Otherwise we mark them as a "Yes," i.e., as a first-time college student. Compare this, e.g., with the HERI/CIRP rules: http://www.gseis.ucla.edu/heri/researchers/FTFT.PDF.

DW_YesOrNoNOTNULL

char(3)

3

MD_DStu_EnteredAsTransfer

"Yes" if a given student entered as a transfer student. "No" otherwise. This attribute is a type 1 SCD, which means that even though it indicates how a student entered Carleton, it is applied to all RowStart/RowEnd times. In other words, once a "yes," always a "yes" here, unless the student's program changes fundamentally (e.g., leaves and is readmitted).

Note that this field doesn't really, truly mark someone specifically as entering as a transfer. At Carleton, you either are or aren't a transfer, and if you are, you (by implication) entered as one.

DW_YesOrNoNOTNULL

char(3)

3

MD_DStu_EnteredAsFullTime

"Yes" if a given student entered as a full-time student. "No" otherwise.

This value is calculated as of the entry term, whether or not a given student completed that term, or was even registered as of census date. Since Carleton has no part-time status (we use just F, L, O, R, X, etc.), we simply mark anyone as registered for less than twelve credits in their first term as entering part time. This is quite rare, if it ever happens, at Carleton.

This attribute is to be distinguished from the EnrollmentStatus, which is a current value - i.e., one that refers to the current term.

DW_YesNoOrUnknown

varchar(7)

7

MD_DStu_LegacyType

This attribute indicates whether a student is a first-generation college student or a legacy of some type. Only a single value is stored here (so a person can't be a first-generation college student, and a parent-legacy). What gets stored here is determined by what is in Recruitment Plus at the time of a student's application.

Values are available for this attribute in Colleague only from Nov 2008 onward, and the data that is there (as of 2010) appears not to be fully consistent with Recruitment Plus. Hence be careful when using this attribute in longitudinal analysis.

Here we treat this attribute as a type-1 attribute, meaning that once it's set, it's set for all time for a given student. The reason for this is that the legacy status is currently only set for applicants, and the data comes (as noted) from Recruitment Plus once, and is then no longer updated (theoretically). So there's no point in making this a type-II SCD. Instead, we treat it as an intrinsic attribute of the student, like gender or ethnicity.

You can think of this attribute as 'legacy status in Recruitment Plus at the time of the student's application'.

DW_CodeDesc

varchar(32)

32

MD_DStu_EnteringCohortYear

Year in which student first enrolled and was assigned a cohort. Not all students are assigned a cohort. Students without an assigned cohort get assigned a nonsense year of 9999 here, in lieu of NULL or an empty string. See also the FinancialAidPerson dimension, where the cohort typically exists, but is provisional for many people (who have applied for/been offered aid, but have not matriculated).

There are 2 pieces of information that can be gleaned from the 'cohort' field in Colleague (from which this data is taken). The first is the entering year (08FF indicates a 2008 start of the undergraduate program). The second is the full-time/transfer information (cohort group ends in FF if first time student, FT if this is considered a transfer student). We break the year out here. The fullt-ime/transfer status is broken out elsewhere.

Note that this attribute is particularly useful in grouping students by what IPEDS calls the full-year cohort, which IPEDS defines as the group of students entering at any time during a particular 12-month period September 1 through August 31 that is established for tracking and reporting Graduation Rate (GRS) data for institutions that primarily offer occupational programs of varying lengths. Students must be full-time and first-time to be considered in the cohort (which we determine from other attributes).

A lot of IPEDS stats, however, focus on the fall cohort, i.e., the group of students who all entered as first-time full-time freshmen in a particular term. For that, see MD_DStu_EntryTerm.

DW_YearAtCarleton

int

 

MD_DStu_GraduatingCohortYear

Actual, or projected, graduating class year for a given student. The procedure for assigning a value to this field is complex, and if a student makes slower or faster than anticipated academic progress, their class year may change periodically.

This attribute is similar to, but not quite the same, as the pref_class_year in Advance (which denotes the class an alum prefers to be associated with, and can be virtually anything; for alumni relations purposes we just don't care what class year a person says they want to be associated with).

Students without a graduation year get assigned a nonsense year of 9999 here, in lieu of NULL or an empty string.

Note that having a graduating cohort year does not mean that a given student has actually graduated. The value here is just the group they should (have) graduate(d) as part of. Also, note that lacking a graduating cohort year may not indicate that a student hasn't graduated. For example, for about a decade (up until 2011) Carleton routinely removed the class year when a student was reported to have died. This practice ceased during the 2010-11 academic year, but the data cleanup may not have been complete in its aftermath, and there is always the possibility of other such practices being introduced in the future. There are also a number of cases where a student receives a graduating cohort year that does not reflect the actual year in which they graduated. For example, a student that entered in the fall of 2003 and graduated in 2010 may in some cases keep their original anticipated class year of 2007.

Do not use this field to limit results to graduates. To be sure to get only graduates in a given query, filter on the MD_DStu_EnrollmentStatus field. Select only students who have graduated. As long as this field is being proprly mainained (and not, e.g., converted to a different code if a student dies), this will limit results, correctly, to graduates.

DW_YearAtCarleton

int

 

MD_DStu_CommencementDate

This field should not be exposed to users except via calculations. It is used, in particular, to calculate IPEDS graduation years, so the actual day of the month is not needed. We basically only use the year.

The graduation and commencement dates (according to current, 2011, business practice) should be the same at the month level (the two may differ as to the exact day, but really shouldn't). So anyway the designation 'commencement' here is somewhat arbitrary. It could just as well have been called a graduation date, except that this would have led to potential confusion with the graduating cohort.

<None>

datetime

 

MD_DStu_CohortCode

The cohort code is used for grouping students when creating various key (e.g., IPEDS) reports. It is set once for a student, and then forever kept the same. Hence we have implemented it here as a type-I SCD attribute.

The values expressed here are taken directly from XSTU.ENTERING.COHORT in Colleague, which is stored in the Federal Cohort Group field on the "UG" student academic level record. The data entry field in Colleague is "Federal Cohort" on the "STAL - Student Academic Level" screen.

The format of the cohort code is: two digit year, single letter for term (F=fall, W=winter, S=spring), single letter for first-year student (F) or transfer (T), for example, 09FF would mean that a given student entered in the fall of 2009 as a full-time student.

The cohort codes go back to the fall of 1993, and they are very carefully maintained, because they are used in various key (e.g., IPEDS) reports. Some of these reports require looking at a specific group of students who entered at a specific point in time, but with some changes and exemptions allowed based on later events, such as death, military service, and missionary work. To factor in later events such as this, but otherwise utilize a point-in-time snapshot requires a mixture of type-I and II SCDs. The cohort code is our type-I SCD. When we update it, e.g., due to a death, it has a retroactive effect.

DW_CodeDescShort

varchar(16)

16

MD_DStu_CatalogYear

Used for degree audit; tells us what set of requirements applies to a given student. If unknown, defaults to 9999.

This attribute is a type-I SCD, because it applies retroactively to all courses a given student took/takes, when determining what requirements they have met. There is generally no reason to 'keep history' on the value entered here.

DW_YearAtCarleton

int

 

MD_DStu_EntryTerm

Term of initial entry to Carleton (e.g., "09/FA"). Defaults to 'Unknown' if the value here is unknown, not applicable, etc.

A lot of IPEDS statistics require grouping students by entry term and by excluding those that didn't start in the fall, and this attribute is very useful in performing such groupings/exclusions.

Other stats require a so-called full-year cohort. For that, use MD_DStu_EnteringCohortYear.

DW_TermShortName

char(7)

7

MD_DStu_TermsCompleted

Count of terms completed as part of degree program. This attribute is often used as a divisor. E.g., count of terms as varsity athlete / terms completed.

When we count here we refactor double or triple terms (e.g., FW, AY, etc.) as multiple terms, not just as one term.

Because the counting method is volatile (we're still not absolutely sure how to do this), this field is not exposed in views like VMD_DegreeStudent.

This number is highly unreliable before the 'converstion' to Colleague in 1992, and is still, as of 2010, not particularly good, in the sense that we expect our understanding of what the Registrar wants here to be refined over time.

<None>

int

 

MD_DStu_TermsInResidence

Terms in residence, for graduates, is a KPI, because it tells us how much (potential) revenue can be gained from each student. Those who graduate with fewer terms in residence, that is, cost us tuition money.

When we count here we refactor double or triple terms (e.g., FW, AY, etc.) as multiple terms, not just as one term.

Because the counting method is volatile (we're still not absolutely sure how to do this), this field is not exposed in views like VMD_DegreeStudent.

This number is highly unreliable before the 'converstion' to Colleague in 1992, and is still, as of 2010, not particularly good, in the sense that we expect our understanding of what the Registrar wants here to be refined over time.

<None>

int

 


2  3