gr00t Model - phospho Training Pipeline

Error Traceback

We faced an issue while training your model.

Traceback (most recent call last):
  File "/opt/conda/lib/python3.11/asyncio/tasks.py", line 500, in wait_for
    return fut.result()
           ^^^^^^^^^^^^
  File "/root/phosphobot/am/gr00t.py", line 970, in read_output
    async for line in process.stdout:
  File "/opt/conda/lib/python3.11/asyncio/streams.py", line 765, in __anext__
    val = await self.readline()
          ^^^^^^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/asyncio/streams.py", line 566, in readline
    line = await self.readuntil(sep)
           ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/asyncio/streams.py", line 658, in readuntil
    await self._wait_for_data('readuntil')
  File "/opt/conda/lib/python3.11/asyncio/streams.py", line 543, in _wait_for_data
    await self._waiter
asyncio.exceptions.CancelledError

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/root/phosphobot/am/gr00t.py", line 981, in run_gr00t_training
    await asyncio.wait_for(read_output(), timeout=timeout_seconds)
  File "/opt/conda/lib/python3.11/asyncio/tasks.py", line 502, in wait_for
    raise exceptions.TimeoutError() from exc
TimeoutError

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/root/src/helper.py", line 165, in predict
    trainer.train(timeout_seconds=timeout_seconds)
  File "/root/phosphobot/am/gr00t.py", line 1146, in train
    asyncio.run(
  File "/opt/conda/lib/python3.11/asyncio/runners.py", line 190, in run
    return runner.run(main)
           ^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/asyncio/runners.py", line 118, in run
    return self._loop.run_until_complete(task)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/asyncio/base_events.py", line 654, in run_until_complete
    return future.result()
           ^^^^^^^^^^^^^^^
  File "/root/phosphobot/am/gr00t.py", line 986, in run_gr00t_training
    raise TimeoutError(
TimeoutError: Training process exceeded timeout of 10800 seconds. Please consider lowering the number of epochs and/or batch size.

Training parameters:

๐Ÿ“– Get Started: docs.phospho.ai

๐Ÿค– Get your robot: robots.phospho.ai

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support