Error in the Loading Raw Data video

Question

This is the error I have:

python s2v1.py                                                                                                             
['', 'id', 'priceLabel', 'name', 'brandId', 'brandName', 'imageLink', 'desc', 'vendor', 'print', 'material']                                      
Traceback (most recent call last):                                                                                                                
  File "/usr/local/pyenv/versions/3.5.0/lib/python3.5/site-packages/numpy/lib/_iotools.py", line 855, in easy_dtype                               
    ndtype = np.dtype(ndtype)                                                                                                                     
TypeError: data type not understood

During handling of the above exception, another exception occurred:

Traceback (most recent call last):                                                                                                                
  File "s2v1.py", line 25, in <module>                                                                                                            
    my_csv= load_data('data.csv')                                                                                                                 
  File "s2v1.py", line 22, in load_data                                                                                                           
    my_csv = numpy.genfromtxt(filename, delimiter=d, skip_header=1, invalid_raise=False, names = 'FIELDNAMES', dtype=DATATYPES)                   
  File "/usr/local/pyenv/versions/3.5.0/lib/python3.5/site-packages/numpy/lib/npyio.py", line 1433, in genfromtxt      
 dtype = easy_dtype(dtype, defaultfmt=defaultfmt, names=names)                                                                                 
  File "/usr/local/pyenv/versions/3.5.0/lib/python3.5/site-packages/numpy/lib/_iotools.py", line 864, in easy_dtype                               
    ndtype = np.dtype(dict(formats=ndtype, names=names))                                                                                          
TypeError: data type "myint" not understood

This is my code:

import csv
import numpy

def open_with_csv(filename, d='\t'):
    data = []
    with open(filename, encoding='utf-8') as tsvin:
        tie_reader = csv.reader(tsvin, delimiter='\t')
        for line in tie_reader:
            data.append(line)
    return data

data_from_csv = open_with_csv('data.csv')
print(data_from_csv[0])


FIELDNAMES = ['', 'id', 'priceLabel', 'name', 'brandId', 'brandName', 'imageLink', 'desc', 'vendor', 'patterned', 'material']

DATATYPES = [('myint', 'i'), ('myid' 'i'), ('price', 'f8'), ('name', 'a200'), ('brandID', '<i8'), ('brandName', 'a200'), ('imageUrl', '|s500'), ('description', '|s900'), ('vendor', '|S100'), ('pattern', '|S500'), ('material', '|S50'), ]


def load_data(filename, d='\t'):
    my_csv = numpy.genfromtxt(filename, delimiter=d, skip_header=1, invalid_raise=False, names = 'FIELDNAMES', dtype=DATATYPES)
    return my_csv

my_csv= load_data('data.csv')

[MOD: added ```python markdown formatting -cf]

Answer 1 · 2016-04-13T23:45:52Z

April 13, 2016 11:45pm

It should be names=FIELDNAMES not names = 'FIELDNAMES':

def load_data(filename, d='\t'):
    my_csv = numpy.genfromtxt(
        filename, delimiter=d, skip_header=1, invalid_raise=False,
        names=FIELDNAMES, dtype=DATATYPES)
    return my_csv

Welcome to the Treehouse Community

Looking to learn something new?

Jesse DeGrendel

Jesse DeGrendel

Error in the Loading Raw Data video

1 Answer

Iain Simmons

Iain Simmons