ValueError: could not convert string to float: 'sepal_length'

Question

Hi. Ken's code executes perfectly, while my code returns this error:

from itertools import groupby

import csv
import matplotlib.pyplot as plt

input_file = "data/iris.csv"

with open(input_file, 'r') as iris_data:
    irises = list(csv.reader(iris_data))

colors = {"Iris-setosa": "#2B5B84", "Iris-versicolor": "g", "Iris-virginica": "purple"}
irises.pop()  # because the list includes an extra unneeded item

for species, group in groupby(irises, lambda i: i[4]):
    import pdb; pdb.set_trace()
    categorized_irises = list(group)
    sepal_lengths = [float(iris[0]) for iris in categorized_irises]
    sepal_widths = [float(iris[1]) for iris in categorized_irises]
    plt.scatter(sepal_lengths, sepal_widths, s=10, c=colors[species], label=species)  # marker size of 10,

-------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-2-106afecb7f6d> in <module>()
     16 
     17     categorized_irises = list(group)
---> 18     sepal_lengths = [float(iris[0]) for iris in categorized_irises]
     19     sepal_widths = [float(iris[1]) for iris in categorized_irises]
     20     plt.scatter(sepal_lengths,sepal_widths,s=10,c=colors[species],label=species)

ValueError: could not convert string to float: 'sepal_length'

For a reference, there's a similar thread, but the responses provided unfortunately did not solve my error.

Thank you anyone in advance!

Answer 1 · 2019-01-17T18:18:59Z

on Jan 17, 2019

Hi again!

5.1,3.5,1.4,0.2,Iris-setosa

4.9,3.0,1.4,0.2,Iris-setosa

4.7,3.2,1.3,0.2,Iris-setosa

4.6,3.1,1.5,0.2,Iris-setosa

Those are 4 first lines of my csv file with coma as a separator of "columns" in csv file."Iris -setosa" has index 4-its in 5th "column" of csv.Does it look the same in Your file?

I would reccomed you doing the following:

just after

with open(input_file, 'r') as iris_data:
    irises = list(csv.reader(iris_data))

I would check what irises returns in lines

for i in irises:
    print (i)

Answer 2 · 2019-01-17T13:13:32Z

on Jan 17, 2019

Have You checked the csv file?It's structure?Does it have last unnecessary field?Maybe the separator is different? Have you tried looping through irises list to check if it goes without a problem and returns all the lines with the correct order of data?

Answer 3 · 2019-01-16T03:26:14Z

on Jan 16, 2019

Hi Mark,

I have seen your other thread, as well. The below code works fine in my local environment (jupyter notebook on anaconda 1.8.7). input_file variable will be different of course depending on where you store the iris.csv file.

import csv
import matplotlib.pyplot as plt
from itertools import groupby 

input_file = "/Users/mustafabasaran/Desktop/iris.csv"

with open(input_file, 'r') as iris_data:
    irises = list(csv.reader(iris_data))

colors = {"Iris-setosa": "#2B5B84", "Iris-versicolor": "g", "Iris-virginica": "purple"}


irises.pop()

for species, group in groupby(irises, lambda i:i[4]):

    categorized_irises = list(group)
    sepal_lengths = [float(iris[0]) for iris in categorized_irises]
    sepal_widths = [float(iris[1]) for iris in categorized_irises]
    plt.scatter(sepal_lengths,sepal_widths,s=10,c=colors[species],label=species)

plt.title("Iris Data Set", fontsize=12)
plt.xlabel("sepal length (cm)",fontsize=10)
plt.ylabel("sepal width (cm)",fontsize=10)
plt.legend(loc="upper right")
plt.show()

I hope this helps.

Answer 4 · 2019-01-20T22:56:46Z

on Jan 20, 2019

"I tried removing the header row, but of course, it's needed the way Ken writes the loop"

Why do You think the header row is needed?

Welcome to the Treehouse Community

Looking to learn something new?

Mark Chesney

Mark Chesney

ValueError: could not convert string to float: 'sepal_length'

Cheo R

Cheo R

Mark Chesney

Mark Chesney

Cameron Stewart

Cameron Stewart

4 Answers

ewelina krawczak

ewelina krawczak

Mark Chesney

Mark Chesney

Mark Chesney

Mark Chesney

ewelina krawczak

ewelina krawczak

Mark Chesney

Mark Chesney

Mustafa Başaran

Mustafa Başaran

Mark Chesney

Mark Chesney

ewelina krawczak

ewelina krawczak