<

Data File Description: data/titanic_first_class.csv

Overall Validation Status: ERROR
  • Locate Data: SUCCESS
    • Sucessfully Located Data File: data/titanic_first_class.csv
  • File Validation: SUCCESS
    • Schema File "schemas/titanic_schema.json" assigned to "data/titanic_first_class.csv"
  • Data Validation: ERROR
    • The cell "Female" in row at position "27" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "None" in row at position "32" and field "Age" at position "5" does not conform to a constraint: constraint "required" is "True"
    • The cell "M" in row at position "37" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Mrs. James Joseph (Margaret Tobin) Brown. From Wikipedia: Margaret Brown (née Tobin; July 18, 1867 – October 26, 1932), posthumously known as "The Unsinkable Molly Brown", was an American socialite and philanthropist. She was a passenger on the RMS Titanic which sank in 1912 and she unsuccessfully urged the crew in Lifeboat No. 6 to return to the debris field to look for survivors. During her lifetime, her friends called her "Maggie", but by her death, obituaries referred to her as the "Unsinkable Mrs. Brown". Gene Fowler referred to her as "Molly Brown" in his 1933 book Timberline. The following year, she was referred to as "The Unsinkable Mrs. Brown" and "Molly Brown" in newspapers. " in row at position "39" and field "Name" at position "3" does not conform to a constraint: constraint "maxLength" is "100"
    • The cell "Mrs. James Joseph (Margaret Tobin) Brown. From Wikipedia: Margaret Brown (née Tobin; July 18, 1867 – October 26, 1932), posthumously known as "The Unsinkable Molly Brown", was an American socialite and philanthropist. She was a passenger on the RMS Titanic which sank in 1912 and she unsuccessfully urged the crew in Lifeboat No. 6 to return to the debris field to look for survivors. During her lifetime, her friends called her "Maggie", but by her death, obituaries referred to her as the "Unsinkable Mrs. Brown". Gene Fowler referred to her as "Molly Brown" in his 1933 book Timberline. The following year, she was referred to as "The Unsinkable Mrs. Brown" and "Molly Brown" in newspapers. " in row at position "39" and field "Name" at position "3" does not conform to a constraint: constraint "pattern" is "^[A-Z][a-z]+\. (.*)$"
    • The cell "F" in row at position "42" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "45" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "48" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "50" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "54" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "58" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "72" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "73" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "77" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "3355.0" in row at position "81" and field "Fare" at position "8" does not conform to a constraint: constraint "maximum" is "1000"
    • The cell "Male" in row at position "99" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "104" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "106" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "107" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "111" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "112" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "M" in row at position "114" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "M" in row at position "119" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "M" in row at position "126" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "M" in row at position "127" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "129" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • Type error in the cell "unknown" in row "129" and field "Age" at position "5": type is "number/default"
    • The cell "Female" in row at position "130" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "131" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "M" in row at position "134" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "136" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "138" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "141" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "F" in row at position "143" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "144" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "146" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • Type error in the cell "10" in row "151" and field "Survived" at position "1": type is "boolean/default"
    • The cell "Male" in row at position "151" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "154" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "161" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "F" in row at position "171" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "F" in row at position "174" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Male" in row at position "176" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "None" in row at position "179" and field "Name" at position "3" does not conform to a constraint: constraint "required" is "True"
    • Row at position "184" has unique constraint violation in field "Name" at position "3": the same as in the row at position 57
    • The cell "-71.0" in row at position "185" and field "Fare" at position "8" does not conform to a constraint: constraint "minimum" is "0"
    • The cell "the Countess. of (Lucy Noel Martha Dyer-Edwards) Rothes" in row at position "187" and field "Name" at position "3" does not conform to a constraint: constraint "pattern" is "^[A-Z][a-z]+\. (.*)$"
    • The cell "F" in row at position "188" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "189" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "191" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "F" in row at position "201" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Female" in row at position "204" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
    • The cell "Washington Augustus Roebling, Mr." in row at position "212" and field "Name" at position "3" does not conform to a constraint: constraint "pattern" is "^[A-Z][a-z]+\. (.*)$"
    • The cell "Female" in row at position "216" and field "Sex" at position "4" does not conform to a constraint: constraint "enum" is "['male', 'female']"
  • Data Quality: WARNING
    • Column "Pclass" contains only one value
Schema: schemas/titanic_schema.json
File Description:
  • # of Fields: 8
  • # of Observations: 216

File Metadata:
  • Format: csv
  • Source: local
  • Last Modified: 2023-06-15T09:41:38.904023
  • File Size: 12528 bytes

Survived

Data Type:

Data Type:

"b" "o" "o" "l" "e" and 2 others...

Count:

215

Most Frequent:

"T" "r" "u" "e"

True/False Ratio:

0.6279

Missing:

1

Percent Missing:

0.0046

Pclass

Data Type:

Data Type:

"c" "a" "t" "e" "g" and 6 others...

Count:

216

Missing:

0

Percent Missing:

0.0000

Unique:

1

Unique Ratio:

0.0046

Most Common Value:

"1"

Most Common Value Count:

216

Most Common Value Ratio:

1.0000

Least Common Value:

"1"

Least Common Value Count:

216

Least Common Value Ratio:

1.0000

Name

Data Type:

Data Type:

"t" "e" "x" "t"

Count:

216

Unique:

214

Percent Unique:

0.9907

Missing:

1

Percent Missing:

0.0046

Most Frequent Characters:

"('r', 603)" "('e', 549)" "('a', 437)" "('i', 365)" "('s', 353)"

Most Frequent Numbers:

"('1', 6)" "('6', 3)" "('2', 3)" "('9', 3)" "('3', 3)"

Most Frequent Punctuation:

"('.', 225)" "('(', 45)" "(')', 45)" "('"', 12)" "(',', 9)"

Most Frequent Words:

"('Mr', 105)" "('Miss', 47)" "('Mrs', 44)" "('William', 19)" "('John', 13)"

Average Word Length:

5.3596

Standard Deviation Word Length:

2.4623

Average Sentence Length:

31.4047

Standard Deviation Sentence Length:

47.0322

Sex

Data Type:

Data Type:

"c" "a" "t" "e" "g" and 6 others...

Count:

216

Missing:

0

Percent Missing:

0.0000

Unique:

6

Unique Ratio:

0.0278

Most Common Value:

"m" "a" "l" "e"

Most Common Value Count:

102

Most Common Value Ratio:

0.4722

Least Common Value:

"F"

Least Common Value Count:

6

Least Common Value Ratio:

0.0278

Age

Data Type:

Data Type:

"t" "e" "x" "t"

Count:

216

Unique:

60

Percent Unique:

0.2778

Missing:

1

Percent Missing:

0.0046

Most Frequent Characters:

"('0', 238)" "('4', 74)" "('3', 69)" "('2', 57)" "('5', 55)"

Most Frequent Numbers:

"('0', 238)" "('4', 74)" "('3', 69)" "('2', 57)" "('5', 55)"

Most Frequent Punctuation:

"('.', 214)"

Most Frequent Words:

"('350', 9)" "('360', 9)" "('300', 8)" "('380', 7)" "('400', 7)"

Average Word Length:

3.0093

Standard Deviation Word Length:

0.2892

Average Sentence Length:

4.0047

Standard Deviation Sentence Length:

0.2267

Siblings/Spouses Aboard

Data Type:

Data Type:

"t" "e" "x" "t"

Count:

216

Unique:

4

Percent Unique:

0.0185

Missing:

0

Percent Missing:

0.0000

Most Frequent Characters:

"('1', 71)" "('2', 5)" "('3', 3)"

Most Frequent Numbers:

"('1', 71)" "('2', 5)" "('3', 3)"

Most Frequent Punctuation:

"('-', 137)"

Most Frequent Words:

"('1', 71)" "('2', 5)" "('3', 3)"

Average Word Length:

1.0000

Standard Deviation Word Length:

0.0000

Average Sentence Length:

1.0000

Standard Deviation Sentence Length:

0.0000

Parents/Children Aboard

Data Type:

Data Type:

"i" "n" "t" "e" "g" and 2 others...

Count:

216

Mean:

0.3565

Standard Deviation:

0.6940

Minimum:

0

25th Percentile:

0.0000

Median:

0.0000

75th Percentile:

0.0000

Maximum:

4

Missing:

0

Percent Missing:

0.0000

Unique:

4

Percent Unique:

0.0185

Highest Precision:

1

Average Precision:

1.0000

Lowest Precision:

1

Fare

Data Type:

Data Type:

"i" "n" "t" "e" "g" and 2 others...

Count:

216

Mean:

98.8653

Standard Deviation:

236.1914

Minimum:

-71

25th Percentile:

30.6958

Median:

60.2875

75th Percentile:

96.7313

Maximum:

3355

Missing:

0

Percent Missing:

0.0000

Unique:

96

Percent Unique:

0.4444

Highest Precision:

4

Average Precision:

2.3889

Lowest Precision:

1

Correlations