📅 2026-02-13 17:59:38 ⏱ 0.06 s pysuricata v0.0.14

Summary

Rows
891
Columns
12
Processed bytes (≈)
121.4 KB
Missing
866 (8.1%)
Duplicates (≈)
0 (0.0%)
Column types
  • Numeric3
  • Categorical9
  • Datetime0
  • Boolean0
Top missing columns
  • Cabin 687 (77.1%)
  • Age 177 (19.9%)
  • Embarked 2 (0.2%)
Quick insights
  • Unique cols: 12
  • Constant cols: 0
  • High-card categoricals: 2
  • Date range: — → —
  • Text cols: 9 (avg len 5.2)
Description
Click to add description...

Sample

Show sample
PassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarked
49549603Yousseff, Mr. Geriousmalenan00262714.4583nanC
64864903Willey, Mr. Edwardmalenan00S.O./P.P. 7517.55nanS
27827903Rice, Master. Ericmale7.04138265229.125nanQ
313211Spencer, Mrs. William Augustus (Marie Eugenie)femalenan10PC 17569146.5208B78C
25525613Touma, Mrs. Darwis (Hanne Youssef Razi)female29.002265015.2458nanC
29829911Saalfeld, Mr. Adolphemalenan001998830.5C106S
60961011Shutes, Miss. Elizabeth Wfemale40.000PC 17582153.4625C125S
31831911Wick, Miss. Mary Nataliefemale31.00236928164.8667C7S
48448511Bishop, Mr. Dickinson Hmale25.0101196791.0792B49C
36736813Moussa, Mrs. (Mantoura Boulos)femalenan0026267.2292nanC

Showing 10 randomly sampled rows from the first chunk.


Variables

Analyzing 12 variables (3 numeric, 9 categorical, 0 datetime, 0 boolean).

Showing 1-10 of 12
PassengerId Numeric int64
  • Positive‑only
Count891
Unique891
Missing0 (0.0%)
Outliers0 (0.0%)
Zeros0 (0.0%)
Infinites0 (0.0%)
Negatives0 (0.0%)
Min1
Q1 (P25)223.5
Median446
Mean446
Q3 (P75)668.5
Max891
Processed bytes (≈)7.0 KB
200 400 600 800 0 20 40 60 80 100 PassengerId
1 0 100 200 300 400 500 PassengerId (log scale)
200 400 600 800 0 10 20 30 40 PassengerId
0.1 1 0 100 200 300 PassengerId (log scale)
200 400 600 800 0 5 10 15 20 PassengerId
0.1 1 0 50 100 150 PassengerId (log scale)
Scale:
Bins:
Survived Categorical int64 approx
  • Case variants
  • Trim variants
  • Empty strings
Count891
Unique (≈)2
Missing0 (0.0%)
Mode0
Mode %61.6%
Empty strings549
Entropy0.9607
Rare levels0 (0.0%)
Top 5 coverage100.0%
Label length (avg)NaN
Length p90
Processed bytes (≈)0.0 B
0 549 rows (61.6%)0549 (61.6%)1 342 rows (38.4%)1342 (38.4%)
Top‑N:
Pclass Categorical int64 approx
  • Case variants
  • Trim variants
Count891
Unique (≈)3
Missing0 (0.0%)
Mode3
Mode %55.1%
Empty strings0
Entropy1.439
Rare levels0 (0.0%)
Top 5 coverage100.0%
Label length (avg)NaN
Length p90
Processed bytes (≈)0.0 B
3 491 rows (55.1%)3491 (55.1%)1 216 rows (24.2%)1216 (24.2%)2 184 rows (20.7%)2184 (20.7%)
Top‑N:
Name Categorical object approx
  • High cardinality
  • Case variants
  • Trim variants
Count891
Unique (≈)892
Missing0 (0.0%)
ModeAllison, Miss. Helen Loraine
Mode %0.1%
Empty strings0
Entropy0.264
Rare levels24 (2.7%)
Top 5 coverage0.6%
Label length (avg)NaN
Length p90
Processed bytes (≈)0.0 B
Allison, Miss. Helen Loraine 1 rows (0.1%)Allison, Miss. Helen Lor…1 (0.1%)Barber, Miss. Ellen "Nellie" 1 rows (0.1%)Barber, Miss. Ellen &quo…1 (0.1%)Baxter, Mrs. James (Helene DeLaudeniere Chaput) 1 rows (0.1%)Baxter, Mrs. James (Hele…1 (0.1%)Bishop, Mrs. Dickinson H (Helen Walton) 1 rows (0.1%)Bishop, Mrs. Dickinson H…1 (0.1%)Other 20 rows (2.2%)Other20 (2.2%)
Allison, Miss. Helen Loraine 1 rows (0.1%)Allison, Miss. Helen Lor…1 (0.1%)Barber, Miss. Ellen "Nellie" 1 rows (0.1%)Barber, Miss. Ellen &quo…1 (0.1%)Baxter, Mrs. James (Helene DeLaudeniere Chaput) 1 rows (0.1%)Baxter, Mrs. James (Hele…1 (0.1%)Bishop, Mrs. Dickinson H (Helen Walton) 1 rows (0.1%)Bishop, Mrs. Dickinson H…1 (0.1%)Connolly, Miss. Kate 1 rows (0.1%)Connolly, Miss. Kate1 (0.1%)Dooley, Mr. Patrick 1 rows (0.1%)Dooley, Mr. Patrick1 (0.1%)Dorking, Mr. Edward Arthur 1 rows (0.1%)Dorking, Mr. Edward Arth…1 (0.1%)Haas, Miss. Aloisia 1 rows (0.1%)Haas, Miss. Aloisia1 (0.1%)Hanna, Mr. Mansour 1 rows (0.1%)Hanna, Mr. Mansour1 (0.1%)Other 15 rows (1.7%)Other15 (1.7%)
Allison, Miss. Helen Loraine 1 rows (0.1%)Allison, Miss. Helen Lor…1 (0.1%)Barber, Miss. Ellen "Nellie" 1 rows (0.1%)Barber, Miss. Ellen &quo…1 (0.1%)Baxter, Mrs. James (Helene DeLaudeniere Chaput) 1 rows (0.1%)Baxter, Mrs. James (Hele…1 (0.1%)Bishop, Mrs. Dickinson H (Helen Walton) 1 rows (0.1%)Bishop, Mrs. Dickinson H…1 (0.1%)Connolly, Miss. Kate 1 rows (0.1%)Connolly, Miss. Kate1 (0.1%)Dooley, Mr. Patrick 1 rows (0.1%)Dooley, Mr. Patrick1 (0.1%)Dorking, Mr. Edward Arthur 1 rows (0.1%)Dorking, Mr. Edward Arth…1 (0.1%)Haas, Miss. Aloisia 1 rows (0.1%)Haas, Miss. Aloisia1 (0.1%)Hanna, Mr. Mansour 1 rows (0.1%)Hanna, Mr. Mansour1 (0.1%)Hosono, Mr. Masabumi 1 rows (0.1%)Hosono, Mr. Masabumi1 (0.1%)Johnson, Mr. William Cahoone Jr 1 rows (0.1%)Johnson, Mr. William Cah…1 (0.1%)Keane, Miss. Nora A 1 rows (0.1%)Keane, Miss. Nora A1 (0.1%)Kelly, Miss. Anna Katherine "Annie Kate" 1 rows (0.1%)Kelly, Miss. Anna Kather…1 (0.1%)Levy, Mr. Rene Jacques 1 rows (0.1%)Levy, Mr. Rene Jacques1 (0.1%)Other 10 rows (1.1%)Other10 (1.1%)
Top‑N:
Sex Categorical object approx
  • Case variants
  • Trim variants
Count891
Unique (≈)2
Missing0 (0.0%)
Modemale
Mode %64.8%
Empty strings0
Entropy0.9362
Rare levels0 (0.0%)
Top 5 coverage100.0%
Label length (avg)NaN
Length p90
Processed bytes (≈)0.0 B
male 577 rows (64.8%)male577 (64.8%)female 314 rows (35.2%)female314 (35.2%)
Top‑N:
Age Numeric float64
  • Missing
  • Positive‑only
  • Many outliers
Count714
Unique88
Missing177 (19.9%)
Outliers11 (1.5%)
Zeros0 (0.0%)
Infinites0 (0.0%)
Negatives0 (0.0%)
Min0.42
Q1 (P25)20.12
Median28
Mean29.7
Q3 (P75)38
Max80
Processed bytes (≈)7.0 KB
20 40 60 0 50 100 150 200 Age
0.1 1 0 100 200 300 Age (log scale)
20 40 60 0 20 40 60 80 100 Age
0.01 0.1 1 0 50 100 150 Age (log scale)
20 40 60 0 10 20 30 40 50 Age
0.01 0.1 1 0 20 40 60 80 Age (log scale)
Scale:
Bins:
SibSp Categorical int64 approx
  • Case variants
  • Trim variants
  • Empty strings
Count891
Unique (≈)7
Missing0 (0.0%)
Mode0
Mode %68.2%
Empty strings608
Entropy1.339
Rare levels2 (1.3%)
Top 5 coverage98.7%
Label length (avg)NaN
Length p90
Processed bytes (≈)0.0 B
0 608 rows (68.2%)0608 (68.2%)1 209 rows (23.5%)1209 (23.5%)2 28 rows (3.1%)228 (3.1%)4 18 rows (2.0%)418 (2.0%)Other 28 rows (3.1%)Other28 (3.1%)
0 608 rows (68.2%)0608 (68.2%)1 209 rows (23.5%)1209 (23.5%)2 28 rows (3.1%)228 (3.1%)4 18 rows (2.0%)418 (2.0%)3 16 rows (1.8%)316 (1.8%)8 7 rows (0.8%)87 (0.8%)5 5 rows (0.6%)55 (0.6%)
Top‑N:
Parch Categorical int64 approx
  • Dominant category
  • Case variants
  • Trim variants
  • Empty strings
Count891
Unique (≈)7
Missing0 (0.0%)
Mode0
Mode %76.1%
Empty strings678
Entropy1.128
Rare levels4 (1.7%)
Top 5 coverage99.4%
Label length (avg)NaN
Length p90
Processed bytes (≈)0.0 B
0 678 rows (76.1%)0678 (76.1%)1 118 rows (13.2%)1118 (13.2%)2 80 rows (9.0%)280 (9.0%)3 5 rows (0.6%)35 (0.6%)Other 10 rows (1.1%)Other10 (1.1%)
0 678 rows (76.1%)0678 (76.1%)1 118 rows (13.2%)1118 (13.2%)2 80 rows (9.0%)280 (9.0%)3 5 rows (0.6%)35 (0.6%)5 5 rows (0.6%)55 (0.6%)4 4 rows (0.4%)44 (0.4%)6 1 rows (0.1%)61 (0.1%)
Top‑N:
Ticket Categorical object approx
  • High cardinality
  • Case variants
  • Trim variants
Count891
Unique (≈)716
Missing0 (0.0%)
Mode113043
Mode %0.1%
Empty strings0
Entropy0.264
Rare levels24 (2.7%)
Top 5 coverage0.6%
Label length (avg)NaN
Length p90
Processed bytes (≈)0.0 B
113043 1 rows (0.1%)1130431 (0.1%)113510 1 rows (0.1%)1135101 (0.1%)19988 1 rows (0.1%)199881 (0.1%)226593 1 rows (0.1%)2265931 (0.1%)Other 20 rows (2.2%)Other20 (2.2%)
113043 1 rows (0.1%)1130431 (0.1%)113510 1 rows (0.1%)1135101 (0.1%)19988 1 rows (0.1%)199881 (0.1%)226593 1 rows (0.1%)2265931 (0.1%)234818 1 rows (0.1%)2348181 (0.1%)250651 1 rows (0.1%)2506511 (0.1%)2647 1 rows (0.1%)26471 (0.1%)2693 1 rows (0.1%)26931 (0.1%)2695 1 rows (0.1%)26951 (0.1%)Other 15 rows (1.7%)Other15 (1.7%)
113043 1 rows (0.1%)1130431 (0.1%)113510 1 rows (0.1%)1135101 (0.1%)19988 1 rows (0.1%)199881 (0.1%)226593 1 rows (0.1%)2265931 (0.1%)234818 1 rows (0.1%)2348181 (0.1%)250651 1 rows (0.1%)2506511 (0.1%)2647 1 rows (0.1%)26471 (0.1%)2693 1 rows (0.1%)26931 (0.1%)2695 1 rows (0.1%)26951 (0.1%)28551 1 rows (0.1%)285511 (0.1%)29011 1 rows (0.1%)290111 (0.1%)315088 1 rows (0.1%)3150881 (0.1%)345364 1 rows (0.1%)3453641 (0.1%)345783 1 rows (0.1%)3457831 (0.1%)Other 10 rows (1.1%)Other10 (1.1%)
Top‑N:
Fare Numeric float64
  • Skewed Right
  • Heavy‑tailed
  • Heaping
  • Many outliers
Count891
Unique821
Missing0 (0.0%)
Outliers116 (13.0%)
Zeros15 (1.7%)
Infinites0 (0.0%)
Negatives0 (0.0%)
Min0
Q1 (P25)7.91
Median14.45
Mean32.2
Q3 (P75)31
Max512.3
Processed bytes (≈)7.0 KB
100 200 300 400 0 200 400 600 800 Fare
0 20 40 60 80 100 Fare (log scale)
100 200 300 400 500 0 200 400 600 Fare
0 10 20 30 40 Fare (log scale)
200 400 0 100 200 300 Fare
0 5 10 15 20 Fare (log scale)
Scale:
Bins:
Cabin Categorical object approx
  • Case variants
  • Trim variants
  • Missing
Count204
Unique (≈)148
Missing687 (77.1%)
Mode
Mode %0.0%
Empty strings0
EntropyNaN
Rare levels0 (0.0%)
Top 5 coverage0.0%
Label length (avg)NaN
Length p90
Processed bytes (≈)0.0 B
Top‑N:
Embarked Categorical object approx
  • Dominant category
  • Case variants
  • Trim variants
  • Missing
Count889
Unique (≈)3
Missing2 (0.2%)
ModeS
Mode %72.4%
Empty strings0
Entropy1.097
Rare levels0 (0.0%)
Top 5 coverage100.0%
Label length (avg)NaN
Length p90
Processed bytes (≈)0.0 B
S 644 rows (72.3%)S644 (72.3%)C 168 rows (18.9%)C168 (18.9%)Q 77 rows (8.6%)Q77 (8.6%)
Top‑N:

Correlations

📊

No significant correlations found (threshold: 0.50)


Missing Values

3 columns with missing values
Cabin
Age
Embarked
Missing per chunk:
≤5%
5-20%
>20%
Cabin
Age
Embarked