我必须在我的计算机上手动插入一些信息,因此我必须检查所有数据是否输入正确。
我想要的输出应该是这样的:奇数表示一个间隔的开始,偶数表示结束(仍然包括在内)。
2015-03-02 15:00:45,1
2015-03-02 15:05:00,18
2015-03-02 17:00:45,19
2015-03-02 17:04:30,34
2015-03-02 17:44:15,33
2015-03-02 17:46:15,41
2015-03-17 15:00:45,1
2015-03-17 15:05:00,18
2015-03-17 17:00:45,19
2015-03-17 17:04:30,34
2015-03-17 17:44:15,33
2015-03-17 17:46:15,41
使用此方法,我们可以查看数据事务和重新键入是否有效。
我到这里的尝试都不起作用,因为它们没有正确地设置所有断点。
mintime = pd.to_datetime(tiere.loc[(tiere.timestamp.shift(-1)-tiere.timestamp)>"00:01:00","timestamp"].values[0:],format="%Y-%m-%d %H:%M:%S").sort_values()
#add to time max and get unique timestamps and sort them works only if tiere resample is NOT ON!!!
maxtime = pd.to_datetime(tiere.loc[(tiere.timestamp-tiere.timestamp.shift(1))>"00:01:00","timestamp"].values[0:],format="%Y-%m-%d %H:%M:%S").sort_values()
#add to time min and get unique timestamps and sort them. works only if tiere resample is NOT ON!!!
min2 = (pd.to_datetime(tiere.loc[(tiere.timestamp.shift(1)-tiere.timestamp)>"00:01:00","timestamp"].values[0:],format="%Y-%m-%d %H:%M:%S").sort_values())
#add to time max and get unique timestamps and sort them works only if tiere resample is NOT ON!!!
max2 = (pd.to_datetime(tiere.loc[(tiere.timestamp-tiere.timestamp.shift(-1))>"00:01:00","timestamp"].values[0:],format="%Y-%m-%d %H:%M:%S").sort_values())
breakpoints = mintime.union(mintimestamp_tiere).union(min2).union(maxtime).union(maxtimestamp_tiere).union(forgottentimedates).union(max2).delete(7)
相关分类