Dataset info
Number of variables | 11 |
---|---|
Number of observations | 2410 |
Total Missing (%) | 4.0% |
Total size in memory | 225.9 KiB |
Average record size in memory | 96.0 B |
Variables types
Numeric | 5 |
---|---|
Categorical | 5 |
Date | 0 |
Text (Unique) | 0 |
Rejected | 1 |
Warnings
abv
has 62 / 2.6% missing values Missingcity
has a high cardinality: 384 distinct values Warningibu
has 1005 / 41.7% missing values Missingid_brewery
is highly correlated with brewery_id
(ρ = 1) Rejectedname_beer
has a high cardinality: 2305 distinct values Warningname_brewery
has a high cardinality: 551 distinct values Warningstate
has a high cardinality: 51 distinct values Warningstyle
has a high cardinality: 100 distinct values Warning abv
Numeric
Distinct count | 75 |
---|---|
Unique (%) | 3.2% |
Missing (%) | 2.6% |
Missing (n) | 62 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.059773 |
---|---|
Minimum | 0.001 |
Maximum | 0.128 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 0.001 |
---|---|
5-th percentile | 0.042 |
Q1 | 0.05 |
Median | 0.056 |
Q3 | 0.067 |
95-th percentile | 0.087 |
Maximum | 0.128 |
Range | 0.127 |
Interquartile range | 0.017 |
Descriptive statistics
Standard deviation | 0.013542 |
---|---|
Coef of variation | 0.22655 |
Kurtosis | 1.1449 |
Mean | 0.059773 |
MAD | 0.010585 |
Skewness | 0.95848 |
Sum | 140.35 |
Variance | 0.00018338 |
Memory size | 37.7 KiB |
Value | Count | Frequency (%) | |
0.05 | 215 | 8.9% | |
0.055 | 158 | 6.6% | |
0.06 | 125 | 5.2% | |
0.065 | 123 | 5.1% | |
0.052 | 107 | 4.4% | |
0.07 | 92 | 3.8% | |
0.045 | 89 | 3.7% | |
0.048 | 72 | 3.0% | |
0.058 | 66 | 2.7% | |
0.056 | 66 | 2.7% | |
Other values (64) | 1235 | 51.2% |
Minimum 5 values
Value | Count | Frequency (%) | |
0.001 | 1 | 0.0% | |
0.027 | 2 | 0.1% | |
0.028 | 1 | 0.0% | |
0.032 | 3 | 0.1% | |
0.034 | 1 | 0.0% |
Maximum 5 values
Value | Count | Frequency (%) | |
0.1 | 1 | 0.0% | |
0.104 | 1 | 0.0% | |
0.12 | 1 | 0.0% | |
0.125 | 1 | 0.0% | |
0.128 | 1 | 0.0% |
brewery_id
Numeric
Distinct count | 558 |
---|---|
Unique (%) | 23.2% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 231.75 |
---|---|
Minimum | 0 |
Maximum | 557 |
Zeros (%) | 0.2% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 11.45 |
Q1 | 93 |
Median | 205 |
Q3 | 366 |
95-th percentile | 505.55 |
Maximum | 557 |
Range | 557 |
Interquartile range | 273 |
Descriptive statistics
Standard deviation | 157.69 |
---|---|
Coef of variation | 0.68041 |
Kurtosis | -1.0875 |
Mean | 231.75 |
MAD | 136.65 |
Skewness | 0.30797 |
Sum | 558517 |
Variance | 24865 |
Memory size | 37.7 KiB |
Value | Count | Frequency (%) | |
10 | 62 | 2.6% | |
25 | 38 | 1.6% | |
166 | 33 | 1.4% | |
141 | 25 | 1.0% | |
46 | 24 | 1.0% | |
80 | 23 | 1.0% | |
131 | 22 | 0.9% | |
165 | 20 | 0.8% | |
368 | 20 | 0.8% | |
107 | 19 | 0.8% | |
Other values (548) | 2124 | 88.1% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 6 | 0.2% | |
1 | 13 | 0.5% | |
2 | 5 | 0.2% | |
3 | 6 | 0.2% | |
4 | 4 | 0.2% |
Maximum 5 values
Value | Count | Frequency (%) | |
553 | 1 | 0.0% | |
554 | 1 | 0.0% | |
555 | 1 | 0.0% | |
556 | 4 | 0.2% | |
557 | 1 | 0.0% |
city
Categorical
Distinct count | 384 |
---|---|
Unique (%) | 15.9% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Grand Rapids | 66 |
---|---|
Portland | 64 |
Chicago | 55 |
Other values (381) |
Value | Count | Frequency (%) | |
Grand Rapids | 66 | 2.7% | |
Portland | 64 | 2.7% | |
Chicago | 55 | 2.3% | |
Indianapolis | 43 | 1.8% | |
San Diego | 42 | 1.7% | |
Boulder | 41 | 1.7% | |
Denver | 40 | 1.7% | |
Brooklyn | 38 | 1.6% | |
Seattle | 35 | 1.5% | |
Longmont | 33 | 1.4% | |
Other values (374) | 1953 | 81.0% |
ibu
Numeric
Distinct count | 108 |
---|---|
Unique (%) | 7.7% |
Missing (%) | 41.7% |
Missing (n) | 1005 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 42.713 |
---|---|
Minimum | 4 |
Maximum | 138 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 11 |
Q1 | 21 |
Median | 35 |
Q3 | 64 |
95-th percentile | 92 |
Maximum | 138 |
Range | 134 |
Interquartile range | 43 |
Descriptive statistics
Standard deviation | 25.954 |
---|---|
Coef of variation | 0.60764 |
Kurtosis | -0.13571 |
Mean | 42.713 |
MAD | 21.705 |
Skewness | 0.79252 |
Sum | 60012 |
Variance | 673.61 |
Memory size | 37.7 KiB |
Value | Count | Frequency (%) | |
20.0 | 82 | 3.4% | |
35.0 | 60 | 2.5% | |
65.0 | 54 | 2.2% | |
30.0 | 53 | 2.2% | |
70.0 | 48 | 2.0% | |
18.0 | 46 | 1.9% | |
25.0 | 45 | 1.9% | |
60.0 | 44 | 1.8% | |
40.0 | 41 | 1.7% | |
15.0 | 40 | 1.7% | |
Other values (97) | 892 | 37.0% | |
(Missing) | 1005 | 41.7% |
Minimum 5 values
Value | Count | Frequency (%) | |
4.0 | 3 | 0.1% | |
5.0 | 5 | 0.2% | |
6.0 | 3 | 0.1% | |
7.0 | 7 | 0.3% | |
8.0 | 9 | 0.4% |
Maximum 5 values
Value | Count | Frequency (%) | |
120.0 | 3 | 0.1% | |
126.0 | 1 | 0.0% | |
130.0 | 1 | 0.0% | |
135.0 | 1 | 0.0% | |
138.0 | 1 | 0.0% |
id_beer
Numeric
Distinct count | 2410 |
---|---|
Unique (%) | 100.0% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 1431.1 |
---|---|
Minimum | 1 |
Maximum | 2692 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 135.25 |
Q1 | 808.25 |
Median | 1453.5 |
Q3 | 2075.8 |
95-th percentile | 2568.5 |
Maximum | 2692 |
Range | 2691 |
Interquartile range | 1267.5 |
Descriptive statistics
Standard deviation | 752.46 |
---|---|
Coef of variation | 0.52579 |
Kurtosis | -1.087 |
Mean | 1431.1 |
MAD | 644.87 |
Skewness | -0.12125 |
Sum | 3448983 |
Variance | 566200 |
Memory size | 37.7 KiB |
Value | Count | Frequency (%) | |
2047 | 1 | 0.0% | |
1226 | 1 | 0.0% | |
1222 | 1 | 0.0% | |
1220 | 1 | 0.0% | |
1218 | 1 | 0.0% | |
1214 | 1 | 0.0% | |
1212 | 1 | 0.0% | |
1210 | 1 | 0.0% | |
1208 | 1 | 0.0% | |
1206 | 1 | 0.0% | |
Other values (2400) | 2400 | 99.6% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 1 | 0.0% | |
4 | 1 | 0.0% | |
5 | 1 | 0.0% | |
6 | 1 | 0.0% | |
7 | 1 | 0.0% |
Maximum 5 values
Value | Count | Frequency (%) | |
2688 | 1 | 0.0% | |
2689 | 1 | 0.0% | |
2690 | 1 | 0.0% | |
2691 | 1 | 0.0% | |
2692 | 1 | 0.0% |
id_brewery
Highly correlated
This variable is highly correlated with brewery_id
and should be ignored for analysis
Correlation | 1 |
---|
name_beer
Categorical
Distinct count | 2305 |
---|---|
Unique (%) | 95.6% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Nonstop Hef Hop | 12 |
---|---|
Oktoberfest | 6 |
Dale's Pale Ale | 6 |
Other values (2302) |
Value | Count | Frequency (%) | |
Nonstop Hef Hop | 12 | 0.5% | |
Oktoberfest | 6 | 0.2% | |
Dale's Pale Ale | 6 | 0.2% | |
Longboard Island Lager | 4 | 0.2% | |
Boston Lager | 3 | 0.1% | |
Dagger Falls IPA | 3 | 0.1% | |
1327 Pod's ESB | 3 | 0.1% | |
312 Urban Pale Ale | 2 | 0.1% | |
White Zombie Ale | 2 | 0.1% | |
Watermelon Ale | 2 | 0.1% | |
Other values (2295) | 2367 | 98.2% |
name_brewery
Categorical
Distinct count | 551 |
---|---|
Unique (%) | 22.9% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Brewery Vivant | 62 |
---|---|
Oskar Blues Brewery | 46 |
Sun King Brewing Company | 38 |
Other values (548) |
Value | Count | Frequency (%) | |
Brewery Vivant | 62 | 2.6% | |
Oskar Blues Brewery | 46 | 1.9% | |
Sun King Brewing Company | 38 | 1.6% | |
Cigar City Brewing Company | 25 | 1.0% | |
Sixpoint Craft Ales | 24 | 1.0% | |
Hopworks Urban Brewery | 23 | 1.0% | |
Stevens Point Brewery | 22 | 0.9% | |
Great Crescent Brewery | 20 | 0.8% | |
21st Amendment Brewery | 20 | 0.8% | |
Bonfire Brewing Company | 19 | 0.8% | |
Other values (541) | 2111 | 87.6% |
ounces
Numeric
Distinct count | 7 |
---|---|
Unique (%) | 0.3% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 13.592 |
---|---|
Minimum | 8.4 |
Maximum | 32 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 8.4 |
---|---|
5-th percentile | 12 |
Q1 | 12 |
Median | 12 |
Q3 | 16 |
95-th percentile | 16 |
Maximum | 32 |
Range | 23.6 |
Interquartile range | 4 |
Descriptive statistics
Standard deviation | 2.3522 |
---|---|
Coef of variation | 0.17305 |
Kurtosis | 9.04 |
Mean | 13.592 |
MAD | 2.0194 |
Skewness | 2.0467 |
Sum | 32757 |
Variance | 5.5329 |
Memory size | 37.7 KiB |
Value | Count | Frequency (%) | |
12.0 | 1525 | 63.3% | |
16.0 | 841 | 34.9% | |
24.0 | 22 | 0.9% | |
19.2 | 15 | 0.6% | |
32.0 | 5 | 0.2% | |
16.9 | 1 | 0.0% | |
8.4 | 1 | 0.0% |
Minimum 5 values
Value | Count | Frequency (%) | |
8.4 | 1 | 0.0% | |
12.0 | 1525 | 63.3% | |
16.0 | 841 | 34.9% | |
16.9 | 1 | 0.0% | |
19.2 | 15 | 0.6% |
Maximum 5 values
Value | Count | Frequency (%) | |
16.0 | 841 | 34.9% | |
16.9 | 1 | 0.0% | |
19.2 | 15 | 0.6% | |
24.0 | 22 | 0.9% | |
32.0 | 5 | 0.2% |
state
Categorical
Distinct count | 51 |
---|---|
Unique (%) | 2.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
CO | 265 |
---|---|
CA | 183 |
MI | 162 |
Other values (48) |
Value | Count | Frequency (%) | |
CO | 265 | 11.0% | |
CA | 183 | 7.6% | |
MI | 162 | 6.7% | |
IN | 139 | 5.8% | |
TX | 130 | 5.4% | |
OR | 125 | 5.2% | |
PA | 100 | 4.1% | |
IL | 91 | 3.8% | |
WI | 87 | 3.6% | |
MA | 82 | 3.4% | |
Other values (41) | 1046 | 43.4% |
style
Categorical
Distinct count | 100 |
---|---|
Unique (%) | 4.2% |
Missing (%) | 0.2% |
Missing (n) | 5 |
American IPA | |
---|---|
American Pale Ale (APA) | 245 |
American Amber / Red Ale | 133 |
Other values (96) |
Value | Count | Frequency (%) | |
American IPA | 424 | 17.6% | |
American Pale Ale (APA) | 245 | 10.2% | |
American Amber / Red Ale | 133 | 5.5% | |
American Blonde Ale | 108 | 4.5% | |
American Double / Imperial IPA | 105 | 4.4% | |
American Pale Wheat Ale | 97 | 4.0% | |
American Brown Ale | 70 | 2.9% | |
American Porter | 68 | 2.8% | |
Saison / Farmhouse Ale | 52 | 2.2% | |
Witbier | 51 | 2.1% | |
Other values (89) | 1052 | 43.7% |