在 Pandas 数据帧上使用布尔过滤器时出现 KeyError

本文介绍了在 Pandas 数据帧上使用布尔过滤器时出现 KeyError的处理方法,对大家解决问题具有一定的参考价值

问题描述

当一个数据帧的日期时间对象在另一个数据帧的日期时间对象范围内时,尝试合并两个数据帧.

Trying to combine two data frames when a datetime object from one dataframe is within a datetime object range in the other.

在我发布的第二个代码块中的这行代码中,不断出现:KeyError: 'cannot use a single bool to index into setitem'.

Keep getting: KeyError: 'cannot use a single bool to index into setitem' on this line of code in the second chunk I posted.

gametaxidf.loc[arrivemask, 'relevant'] = 1

我假设它也会在下一行使用类似的命令发生.

I'm assuming it would happen on the following line with a similar command as well.

这是给我带来麻烦的部分:

This is the part giving me trouble:

with open('/Users/benjaminprice/Desktop/TaxiCombined/Data/combinedtaxifiltered.csv', 'w') as csvfile: 
    fieldnames1 = ['index','pickup_datetime', 'dropoff_datetime', 'pickup_long', 'pickup_lat','dropoff_long','dropoff_lat','passenger_count','trip_distance','fare_amount','tip_amount','total_amount','stadium_code'] 
    writer = csv.DictWriter(csvfile, fieldnames=fieldnames1) 
    writer.writeheader()

for index, row in baseballdf.iterrows(): 
    gametimestart = row['Start.Time'] 
    gametimeend = row['End.Time'] 
    arrivemin = gametimestart - datetime.timedelta(minutes=120) 
    arrivemax = gametimeend - datetime.timedelta(minutes = 30) 
    departmin = gametimeend - datetime.timedelta(minutes = 60) 
    departmax = gametimeend + datetime.timedelta(minutes = 90)

    gametaxidf = combineddf[combineddf.DATE==row.DATE]
    gametaxidf['relevant']=0

    for index, row in gametaxidf.iterrows():
        arrivemask = (arrivemin < row['dropoff_datetime']) and (row['dropoff_datetime'] < arrivemax)
        departmask = (departmin < row['pickup_datetime']) and (row['pickup_datetime'] < departmax) 
        gametaxidf.loc[arrivemask, 'relevant'] = 1
        gametaxidf.loc[departmask, 'relevant'] = 1

        with open('/Users/benjaminprice/Desktop/TaxiCombined/Data/combinedtaxifiltered.csv','a') as combinedtaxi:
            gametaxidf.to_csv(combinedtaxi,header=None)
    print(str(index) + "done")

Gametaxidf.head(5):

Gametaxidf.head(5):

   index     pickup_datetime    dropoff_datetime  pickup_long  pickup_lat  
0    195 2014-04-01 00:08:13 2014-04-01 00:15:32   -73.922218   40.827557   
1    344 2014-04-01 00:16:30 2014-04-01 00:20:38   -73.846046   40.754566   
2    558 2014-04-01 00:28:59 2014-04-01 00:36:36   -73.921692   40.831394   
3    744 2014-04-01 00:42:00 2014-04-01 00:49:46   -73.938080   40.804646   
4    776 2014-04-01 00:43:54 2014-04-01 00:53:22   -73.952652   40.810577   

   dropoff_long  dropoff_lat  passenger_count  trip_distance  fare_amount  
0    -73.900620    40.856174                1           2.30          9.0   
1    -73.890259    40.753246                1           0.56          4.5   
2    -73.942719    40.823257                1           1.53          7.0   
3    -73.928490    40.830433                1           2.96         11.0   
4    -73.924332    40.827320                1           2.28         10.5   

   tip_amount  total_amount  stadium_code       DATE  relevant  
0           0          10.0           1.1 2014-04-01         0  
1           0           5.5           2.1 2014-04-01         0  
2           0           8.0           1.1 2014-04-01         0  
3           0          12.0           1.0 2014-04-01         0  
4           0          11.5           1.0 2014-04-01         0 

还收到此警告:正在尝试在 DataFrame 中的切片副本上设置值.

Also getting this warning: A value is trying to be set on a copy of a slice from a DataFrame.

Try using .loc[row_indexer,col_indexer] = value instead

但它让我继续经历……任何帮助都会很棒.

But it's letting me continue through that... any help would be great.

推荐答案

这里

gametaxidf.loc[arrivemask, 'relevant'] = 1

您正在尝试通过 .loc 运算符设置数据帧值.用于选择行的 Pandas 文档 说:

you're trying to set dataframe values by .loc operator. Pandas docs for selecting rows says:

.loc 主要是基于标签的,但也可以与布尔数组一起使用..loc 将在未找到项目时引发 KeyError.允许的输入是:

.loc is primarily label based, but may also be used with a boolean array. .loc will raise KeyError when the items are not found. Allowed inputs are:

  • 单个标签,例如5 或 'a',(注意 5 被解释为索引的标签.此用法不是沿索引的整数位置)
  • 标签列表或数组 ['a', 'b', 'c']
  • 带有标签'a':'f'的切片对象,(注意与通常的python切片相反,开始和结束都包括在内!)
  • 一个布尔数组

您正在尝试使用最后一种类型的输入,但是这个

You're trying to use the last type of input, but this

arrivemask = (arrivemin < row['dropoff_datetime']) and 
    (row['dropoff_datetime'] < arrivemax)

是标量布尔值,而不是数组.

is scalar boolean, not array.

您无需遍历数据框.熊猫为你做这件事.只需使用:

You need not to iterate through dataframe. Pandas does it for you. Just use:

gametaxidf.loc[
   (arrivemin < gametaxidf['dropoff_datetime'])
   &
   (gametaxidf['dropoff_datetime'] < arrivemax)
   , 'relevant'] = 1

这篇关于在 Pandas 数据帧上使用布尔过滤器时出现 KeyError的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,WP2

admin_action_{$_REQUEST[‘action’]}

do_action( "admin_action_{$_REQUEST[‘action’]}" )动作钩子::在发送“Action”请求变量时激发。Action Hook: Fires when an ‘action’ request variable is sent.目录锚点:#说明#源码说明(Description)钩子名称的动态部分$_REQUEST['action']引用从GET或POST请求派生的操作。源码(Source)更新版本源码位置使用被使用2.6.0 wp-admin/admin.php:...

日期:2020-09-02 17:44:16 浏览:1127

admin_footer-{$GLOBALS[‘hook_suffix’]}

do_action( "admin_footer-{$GLOBALS[‘hook_suffix’]}", string $hook_suffix )操作挂钩:在默认页脚脚本之后打印脚本或数据。Action Hook: Print scripts or data after the default footer scripts.目录锚点:#说明#参数#源码说明(Description)钩子名的动态部分,$GLOBALS['hook_suffix']引用当前页的全局钩子后缀。参数(Parameters)参数类...

日期:2020-09-02 17:44:20 浏览:1033

customize_save_{$this->id_data[‘base’]}

do_action( "customize_save_{$this-&gt;id_data[‘base’]}", WP_Customize_Setting $this )动作钩子::在调用WP_Customize_Setting::save()方法时激发。Action Hook: Fires when the WP_Customize_Setting::save() method is called.目录锚点:#说明#参数#源码说明(Description)钩子名称的动态部分,$this->id_data...

日期:2020-08-15 15:47:24 浏览:775

customize_value_{$this->id_data[‘base’]}

apply_filters( "customize_value_{$this-&gt;id_data[‘base’]}", mixed $default )过滤器::过滤未作为主题模式或选项处理的自定义设置值。Filter Hook: Filter a Customize setting value not handled as a theme_mod or option.目录锚点:#说明#参数#源码说明(Description)钩子名称的动态部分,$this->id_date['base'],指的是设置...

日期:2020-08-15 15:47:24 浏览:866

get_comment_author_url

过滤钩子:过滤评论作者的URL。Filter Hook: Filters the comment author’s URL.目录锚点:#源码源码(Source)更新版本源码位置使用被使用 wp-includes/comment-template.php:32610...

日期:2020-08-10 23:06:14 浏览:903

network_admin_edit_{$_GET[‘action’]}

do_action( "network_admin_edit_{$_GET[‘action’]}" )操作挂钩:启动请求的处理程序操作。Action Hook: Fires the requested handler action.目录锚点:#说明#源码说明(Description)钩子名称的动态部分$u GET['action']引用请求的操作的名称。源码(Source)更新版本源码位置使用被使用3.1.0 wp-admin/network/edit.php:3600...

日期:2020-08-02 09:56:09 浏览:848

network_sites_updated_message_{$_GET[‘updated’]}

apply_filters( "network_sites_updated_message_{$_GET[‘updated’]}", string $msg )筛选器挂钩:在网络管理中筛选特定的非默认站点更新消息。Filter Hook: Filters a specific, non-default site-updated message in the Network admin.目录锚点:#说明#参数#源码说明(Description)钩子名称的动态部分$_GET['updated']引用了非默认的...

日期:2020-08-02 09:56:03 浏览:834

pre_wp_is_site_initialized

过滤器::过滤在访问数据库之前是否初始化站点的检查。Filter Hook: Filters the check for whether a site is initialized before the database is accessed.目录锚点:#源码源码(Source)更新版本源码位置使用被使用 wp-includes/ms-site.php:93910...

日期:2020-07-29 10:15:38 浏览:809

WordPress 的SEO 教学:如何在网站中加入关键字(Meta Keywords)与Meta 描述(Meta Description)?

你想在WordPress 中添加关键字和meta 描述吗?关键字和meta 描述使你能够提高网站的SEO。在本文中,我们将向你展示如何在WordPress 中正确添加关键字和meta 描述。为什么要在WordPress 中添加关键字和Meta 描述?关键字和说明让搜寻引擎更了解您的帖子和页面的内容。关键词是人们寻找您发布的内容时,可能会搜索的重要词语或片语。而Meta Description则是对你的页面和文章的简要描述。如果你想要了解更多关于中继标签的资讯,可以参考Google的说明。Meta 关键字和描...

日期:2020-10-03 21:18:25 浏览:1620

谷歌的SEO是什么

SEO (Search Engine Optimization)中文是搜寻引擎最佳化,意思近于「关键字自然排序」、「网站排名优化」。简言之,SEO是以搜索引擎(如Google、Bing)为曝光媒体的行销手法。例如搜寻「wordpress教学」,会看到本站的「WordPress教学:12个课程…」排行Google第一:关键字:wordpress教学、wordpress课程…若搜寻「网站架设」,则会看到另一个网页排名第1:关键字:网站架设、架站…以上两个网页,每月从搜寻引擎导入自然流量,达2万4千:每月「有机搜...

日期:2020-10-30 17:23:57 浏览:1264