Introduction and explanation of SPSS's own case data file

The introduction and explanation of case data files brought by SPSS beginners have great demand for case data files. In fact, during the installation of SPSS software package, these files have been automatically placed on your computer hard disk. So how to find it? I mentioned earlier "where to download the case data of SPSS". Students who need it can find or download it themselves. Today I will share the description of the case data file of SPSS. See below for details:

Accident. sav

The hypothetical data file involves an insurance company, which is studying the age and gender risk factors of automobile accidents in a given area. Each case corresponds to a cross-classification of age category and gender category.

adl.sav

This hypothetical data file involves the advantages of determining the recommended treatment type for stroke patients. Doctors randomly divided female stroke patients into two groups. The first group received standard physical therapy, while the second group received additional emotional therapy. During the three-month treatment period, each patient's ability to perform general daily activities will be scored as the original variable.

advert.sav

This hypothetical data file involves the actions taken by retailers when checking the relationship between advertising expenditure and sales performance. To this end, they collected past sales figures and related advertising expenses.

Aflatoxin. sav

This hypothetical data file involves the detection of aflatoxin in grains, and the concentration of aflatoxin will vary greatly due to different grain yields (between different grains and between the same kind of grains). The grain processing plant obtained 16 samples from each of the eight grain yields, and measured the level of aflatoxin in parts per billion (PPB).

anorectic

When studying the standard symptom reference of anorexia/bulimia, the researcher 1 investigated 55 teenagers who were known to have eating disorders. Each patient will be examined four times a year, so the total number of observations is 220. During each observation period, these patients will be graded item by item according to the symptoms of 16. However, the symptom scores of patients No.765438 and No.76 were missing at time point 2, and the symptom score of patient No.47 was missing at time point 3, so the number of effective observations was 2 17.

bankloan.sav

This hypothetical data file involves a bank's measures to reduce the loan default rate. This file contains financial and demographic information of 850 past and potential customers. The first 700 cases are customers who have obtained loans before. The remaining 150 cases are potential customers, and banks need to classify them according to the level of credit risk.

bankloan_binning.sav

This hypothetical data file contains financial and demographic information of 5,000 past customers.

behavior.sav

In a classic example, 252 students were asked to evaluate the combination of 15 situations and 15 behaviors on a scale of 10, ranging from 0 = "very appropriate" to 9 = "very inappropriate". The average value is above personal value, and values are regarded as different.

behavior_ini.sav

This data file contains the initial configuration of the behavior.sav 2D solution.

brakes.sav

This hypothetical data file is related to the quality control of the factory that produces high-performance automobile disc brakes. This data file contains the diameter measurements of 16 disc brakes for each of the eight special machine tools. The target diameter of disc brake is 322 mm.

Breakfast. sav

In a classic study, 32/kloc-0 MBA students from Wharton Business School and their spouses were asked to rate 15 breakfast foods in order of preference, from 1 = and their preferences were recorded according to six different situations, from "all like" to "fast food with drinks only".

Breakfast-generally speaking. save

The data file only contains the first case of breakfast food preference, that is, "like it all".

Broadband _ 1.sav

This hypothetical data file contains the number of customers who have subscribed to the national broadband service in each region. The data file contains the monthly number of users in 85 regions during the four-year period.

Broadband _2.sav

This data file is the same as broadband_ 1.sav, but contains data for another three months.

Automobile _ insurance _ claim. sav

Four data sets on automobile damage compensation that have been put forward and analyzed elsewhere. The average claim amount can be modeled as a gamma distribution, and the average value of the dependent variable is related to the linear combination of the insured's age, vehicle type and vehicle age by using the inverse correlation function. The number of claims can be used as a measure.

car_sales.sav

This data file contains hypothetical sales estimates, pricing and physical specifications of various brands and models of vehicles. Prices and physical specifications can be obtained from edmunds.com and manufacturers.

car _ sales _ up prepared . sav

This is a modified version of car_sales.sav and does not contain any translated version of this field.

Carpet. sav

In a common example 5, a company attaches great importance to the marketing of a new carpet cleaning product, hoping to test the influence of the following five factors on consumer preferences: packaging design, brand name, price, excellent household goods logo and return guarantee. There are three factor levels in packaging design, and each factor level is different because of the different position of brush body; There are three brand names (K2R, Glory, Bisell); There are three price levels; The last two factors each have two levels (yes or no). Ten consumers ranked 22 characteristics defined by these factors. This variable preferably contains the rank of the average level of each profile. Low grades correspond to high preferences. This variable reflects the overall measure of preference for each profile.

carpet_prefs.sav

This data file is based on the same example described in carpet.sav, but it also contains the actual ranking order collected from 10 consumers. Consumers were asked to rank 22 product profiles from favorite to least favorite. Carpet_plan.sav defines that variables PREF 1 to PREF22 contain identifiers of related features.

Directory. sav

This data file contains hypothetical monthly sales data of three products sold by a cataloging company. It also includes data of five possible predictive variables.

catalog_seasfac.sav

This data file is the same as catalog.sav, except that a set of season factors calculated in the process of "season decomposition" and additional date variables are added.

Cellular telephone. sav

This hypothetical data file is related to a mobile phone company's measures to reduce customer churn. Customer churn scores are applied to accounts, and the scores range from 0 to 100. Accounts with a score of 50 or higher may change providers.

Ceramics.sav

This hypothetical data file involves the actions taken by the manufacturer to determine whether the new high-quality alloy has higher heat resistance than the standard alloy. Each case represents an individual test of an alloy; The heat resistance limit of the alloy will be recorded in the shell.

Cereal. sav

The hypothetical data file involves a poll of 880 people about breakfast preferences, which records the participants' age, gender, marital status and active lifestyle (according to whether they exercise at least twice a week). Each case represents an individual responder.

clothing_defects.sav

This is a hypothetical data file about the quality control process of garment factory. Inspectors should sample and test the clothes produced in large quantities in factories every time, and count the number of unqualified clothes.

Coffee. sav

This is a data file about the cognitive brand image 6 of six kinds of iced coffee. For each of the 23 characteristic attributes of iced coffee, people will choose all the brands described by this attribute. For the sake of confidentiality, the six brands are represented by AA, BB, CC, DD, EE and FF respectively.

Contact. sav

The hypothetical data file includes a list of contact information of a group of company computer sales representatives. Each contact method is classified according to the company department to which these sales representatives belong and the level of their company. At the same time, it also recorded the latest sales volume, the time since the last sale, and the size of the company contacted.

creditpromo.sav

This hypothetical data file involves the measures taken by a department store to evaluate the effect of the latest credit card promotion. To this end, 500 cardholders were randomly selected. Half of them have received advertisements to reduce consumer interest rates in the next three months. The other half received standard seasonal advertisements.

Customer database

This hypothetical data file involves a company's use of information in the data warehouse to provide preferential products to customers who are most likely to respond. Randomly select a subset of the customer base, offer them special offers, and record their reactions.

Customer information. sav

This hypothetical data file contains customer email information, such as name and address.

Customer _ subset. sav

A subset of 80 cases in customer_dbase.sav

Debate. sav

It is assumed that the data file includes paired answers to surveys conducted by participants in political debates before and after the debate. Each case corresponds to a separate responder.

Debate _ aggregation. sav

This hypothetical data file summarizes the answers in the debate. sav. Each case corresponds to the cross-classification of preferences before and after the debate.

Demonstration. sav

This is a hypothetical data file about the shopping customer database, which is used to send monthly goods. Will record the customer's reaction to the goods and various demographic information.

demo_cs_ 1.sav

This hypothetical data file involves the first step of the company in compiling the survey information database. Each case corresponds to a different city and records the regional, provincial, district and city identification.

Demonstration _cs_2.sav

This hypothetical data file involves the second step of the company in compiling the survey information database. Each case corresponds to a different family unit in the city selected in the first step, and the identification of the region, province, district, city, street and unit is recorded. It also includes the sampling information of the first two design stages.

demo_cs.sav

Assume that the data file contains survey information collected by complex sampling design. Each case corresponds to a different family unit and records various demographic and sampling information.

dmdata.sav

The hypothetical data file contains demographic and purchasing information of direct selling companies. Dmdata2.sav contains the information of a subset of contacts who received the trial email, and dmdata3.sav contains the information of the remaining contacts who did not receive the trial email.

dietstudy.sav

This hypothetical data file contains the research results of "steelman Diet" 7. Each case corresponds to a single subject, and the weight (pounds) and triglyceride level (mg/100 ml) before and after the implementation of the diet plan were recorded.

dvdplayer.sav

This is a hypothetical data file about developing a new DVD player. Marketing teams use prototypes to collect focus group data. Each case corresponds to one surveyed user, recording their demographic information and their answers to prototype questions.

German _credit.sav

The data file is taken from the "German Credit" data set in the repository of the machine learning database 8 of the University of California, Irvine.

Grocery store _ 1month.sav

The hypothetical data file is based on the data file "groceries _ coupons. sav" and weekly shopping "accumulation" is added, so each case corresponds to a separate customer. So some variables that changed every week disappeared, and now the recorded consumption amount is the sum of the consumption amount during the four-week study period.

Grocery store _ coupons.sav

The hypothetical data file contains survey data collected by grocery chain stores that value customers' shopping habits. Investigate every customer around you, each case corresponds to a separate customer week, and record information such as the place and way of shopping (including the amount of money customers spent on groceries in that week).

guttman.sav

Bell 9 created a table to illustrate possible social groups. Guttman 10 refers to a part of the table, which includes five variables. It is used to describe the following seven theoretical social groups: audience (such as people in a football match), audience (such as people in a theater or attending a class lecture), public (such as newspaper or TV audience), organization group (similar to audience but closely related), primary group (closely related), secondary group (spontaneous organization) and modern community.

health_funding.sav

This hypothetical data file contains data about the health care fund (amount per1000 people), the incidence rate (ratio per 10000 people) and the attendance rate of health care providers (ratio per 10000 people). Each case represents a different city.

hivassay.sav

This hypothetical data file involves the initiative of a drug laboratory in developing a rapid analysis to detect HIV infection. The test result is 8 dark red shadows. If there are deeper shadows, it means that there is a great possibility of infection. 2000 blood samples were used for laboratory tests, half of which were infected with HIV and the other half were not infected.

hourlywagedata.sav

The hypothetical data file involves the hourly wages of nurses with different experience levels working in government agencies and hospitals.

insurance_claims.sav

The hypothetical data file involves an insurance company, which wants to build a model to mark suspicious and potentially fraudulent claims. Each case represents a separate claim.

insure.sav

Suppose the data file involves an insurance company, which is studying the risk factors that indicate whether customers will claim compensation under the life insurance contract of 10. Each case in the data file represents a contract matched by age and gender, one of which records the claim and the other does not.

judges.sav

The hypothetical data file involves the scores given by a trained judge (plus a gymnast) for 300 gymnastics performances. Each row represents a performance; The judges watched the same performance.

Kinship _dat.sav

Rosenberg and Kim 1 1 began to analyze 15 kinship items (aunt, brother, cousin, daughter, father, granddaughter, grandfather, grandmother, grandson, mother, nephew or nephew, niece or niece, sister, son and uncle). They asked four groups of college students (two groups of female students and two groups of male students) to rank the projects according to their similarity. They asked two groups of students (a group of female students and a group of male students) to rank twice, and the second ranking was different from the first ranking. In this way, a * * * gets six groups of "sources". Each source corresponds to an approximate matrix of 15 x 15, and the value in its cell is equal to the number of people in the source minus the number of times the objects in this source are divided.

Kinship _ini.sav

This data file contains the initial configuration of the three-dimensional solution of the kinship _ dat.sav.

Kinship _var.sav

The data file contains independent variables such as gender, generation and (separated) degree, which can be used to explain the dimension of the solution of kinship _ dat.sav. Specifically, they can be used to limit the solution space to a linear combination of these variables.

Market value. sav

This data file covers the housing sales of the new housing development project in Algonquin, Illinois. During1999–2000. These sales only come from public records.

nhis2000_subset.sav

The American Health Interview Survey (NHIS) is a large-scale population survey aimed at all American citizens. The survey conducted face-to-face interviews with representative family samples in the United States, and obtained demographic information and observation data on the health behavior and health status of each family member. This data file contains a subset of information from the 2000 survey. National Center for Health Statistics. American health interview survey in 2000. Common data files and documents. FTP://ftp.cdc.gov/pub/health _ statistics/nchs/datasets/NHIS/2000/.Published in 2003.

Ozone.sav

These data contain 330 observations of six meteorological variables, which are used to predict ozone concentration according to the remaining variables. In previous researchers, 12 and 13 found the nonlinearity between these variables, which hindered the standard regression method.

Pain _ medicine. sav

This hypothetical data file contains the results of clinical trials of anti-inflammatory drugs used to treat chronic arthritis pain. We are interested in the time when the drug takes effect and the comparison with the existing drugs.

patient_los.sav

This hypothetical data file contains the treatment records of patients with suspected myocardial infarction (MI or "heart attack") diagnosed by the hospital. Each case corresponds to a patient, and some variables related to his hospitalization period are recorded.

patlos_sample.sav

This hypothetical data file contains sample treatment records of patients who received thrombolytic agents during the treatment of myocardial infarction (that is, MI or "heart disease"). Each case corresponds to a patient, and some variables related to his hospitalization period are recorded.

poll_cs.sav

This hypothetical data file involves the actions of polling agencies to determine the level of public support for the bill before formal legislation. This situation corresponds to registered voters. Each case records the counties, towns and districts where voters live.

poll_cs_sample.sav

This hypothetical data file contains a sample of voters listed in poll_cs.sav The samples are selected according to the design specified in poll.csplan, and the data file records contain probability and sample weight. Please note that there is also a file (poll _ jointprobe. sav) containing joint selection probability because the sampling plan uses the method of proportional to scale (PPS). After the sample is selected, additional variables corresponding to voters' demographic information and their opinions on submitting bills will be collected and added to the data file.

property _ assesse . sav

This hypothetical data file involves the measures taken by an asset appraiser in a county to constantly update the asset value appraisal with limited resources. This case corresponds to the assets sold by the county in the past year. Each case in the data file records the town where the asset is located, the appraiser who last appraised the asset, the time since the appraisal, the current appraisal and the selling price of the asset.

property _ assesse _ cs . sav

This hypothetical data file involves the measures taken by an asset appraiser to constantly update the asset value appraisal with limited resources in a certain state. This case corresponds to the assets of the state. Each case in the data file records the county, town and district where the assets are located, the time since the last assessment and the valuation at that time.

Attribute _ evaluation _ cs _ sample. sav

This hypothetical data file contains asset samples listed in property _ assessment _ cs.sav Samples are selected according to the design specified in property _ assessment. Csplan, data file records contain probability and sample weight. After selecting samples, additional variable current values will be collected and added to the data file.

Recidivism. sav

This hypothetical data file involves the initiatives of government law enforcement agencies in understanding the recidivism rate within their jurisdiction. Each case corresponds to an ex-convict, whose demographic information and details of the first crime are recorded; If you are arrested for the second time within two years after your first arrest, the time between arrests will also be recorded.

Recidivism _cs_sample.sav

This hypothetical data file involves the initiatives of government law enforcement agencies in understanding the recidivism rate within their jurisdiction. Each case corresponds to a former criminal who was arrested and released for the first time in June 2003, and recorded his demographic information, details of his first crime and data of his second arrest (if it happened before the end of June 2006). According to the sampling scheme specified in recidivist _cs.csplan, criminals are selected from the sampling department; The plan uses the method of proportional to size (PPS), so there is also a file (recidivism _ cs _ jointprobe.sav) containing the joint selection probability.

rfm_transactions.sav

This hypothetical data file contains purchase transaction data, that is, the purchase date, the goods purchased and the consumption amount of each transaction.

salesperformance.sav

This is a hypothetical data file about evaluating two new sales training courses. 60 employees were divided into 3 groups and all received standard training. In addition, the second group received technical training; The third group received practical guidance. At the end of the training course, every employee will be tested and their scores will be recorded. Each case in the data file represents a student and records the group to which he is assigned and the score of the test.

satisf.sav

This hypothetical data file involves a satisfaction survey conducted by a retail company in four stores. A total of 582 customers were surveyed, and each case represented a customer's answer.

Screw.sav

This data file contains information about the features of screws, bolts, nuts and pushpins 14.

Shampoo _ph.sav

This is a hypothetical data file about the quality control of a hairdressing product factory. Test six batches of independent products at specified time intervals and record their pH values. The target range is 4.5–5.5.

ships.sav

& lt