简单的 apache beam 管道抛出 TypeError:需要一个整数

我有一个用 python 编写的简单的 apache-beam 管道。


import apache_beam as beam

from apache_beam.runners.interactive.interactive_runner import InteractiveRunner

from apache_beam.runners.direct.direct_runner import DirectRunner

from apache_beam.options.pipeline_options import PipelineOptions


p = beam.Pipeline(InteractiveRunner(underlying_runner=DirectRunner()), options=PipelineOptions())


class Foo(beam.DoFn):

  def process(self, element, *args, **kwargs):

    k,v = element

    yield (k , {k:v})



a = [('a',1), ('a',1), ('b',1)]


x0 = p | "0" >> beam.Create(a) | "2" >> beam.ParDo(Foo())

r = p.run()

r.wait_until_finish()

print(r.get(x0))

运行上面的代码会抛出以下错误


Traceback (most recent call last):

  File "apache_beam/runners/common.py", line 883, in apache_beam.runners.common.DoFnRunner.process

  File "apache_beam/runners/common.py", line 497, in apache_beam.runners.common.SimpleInvoker.invoke_process

  File "apache_beam/runners/common.py", line 1028, in apache_beam.runners.common._OutputProcessor.process_outputs

  File "apache_beam/runners/worker/operations.py", line 177, in apache_beam.runners.worker.operations.SingletonConsumerSet.receive

  File "apache_beam/runners/worker/operations.py", line 155, in apache_beam.runners.worker.operations.ConsumerSet.update_counters_start

  File "apache_beam/runners/worker/opcounters.py", line 214, in apache_beam.runners.worker.opcounters.OperationCounters.update_from

  File "apache_beam/runners/worker/opcounters.py", line 233, in apache_beam.runners.worker.opcounters.OperationCounters.do_sample

  File "apache_beam/coders/coder_impl.py", line 1162, in apache_beam.coders.coder_impl.WindowedValueCoderImpl.get_estimated_size_and_observables

  File "apache_beam/coders/coder_impl.py", line 1172, in apache_beam.coders.coder_impl.WindowedValueCoderImpl.get_estimated_size_and_observables

a但是,如果我们设置为列表列表而不是元组列表,则相同的代码可以工作。


a = [('a',1), ('a',1), ('b',1)]-->a = [['a',1], ['a',1], ['b',1]]


呼唤远方
浏览 151回答 0
0回答

互换的青春

我已经确认这是一个错误。提交https://issues.apache.org/jira/browse/BEAM-10833
打开App,查看更多内容
随时随地看视频慕课网APP

相关分类

Python